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MLL TRANSLOCATIONS SPECIFY A DISTINCT 
GENE EXPRESSION PROFILE, DISTINGUISHING A UNIQUE LEUKEMIA 



RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
5 60/306,103 filed on July 17, 2001. The entire teachings of the above application are 
incorporated herein by reference. 

GOVERNMENT SUPPORT 

The invention was supported, in whole or in part,"by a grant P01CA68484 
from the National Institutes of Health. The Government has certain rights in the 
10 invention. 

BACKGROUND OF THE INVENTION 

A subset of human acute leukemias with a decidedly unfavorable prognosis 
possess a chromosomal translocation involving the Mixed Lineage Leukemia (MLL, 
HEX, AU-1) gene on chromosome segment llq23. The leukemic cells, which 

1 5 typically have a lymphoblastic morphology, have been classified as Acute 

Lymphoblastic Leukemia (ALL). However, unlike the majority of childhood ALL, 
the presence of the MLL translocations often results in an early relapse after 
chemotherapy. As MLL translocations are typically found in leukemias of infants 
and chemotherapy-induced leukemia, it has remained uncertain whether host related 

20 fectors or tumor-intrinsic biological differences are responsible for the poor survival 
in patients with the translocations. Lymphoblastic leukemias with either rearranged 
or germline MLL are similar with respect to most morphological and histochemical 
characteristics, hnmunophenotypic differences associated with lymphoblasts 
bearing an MLL translocation include the lack of the eariy lymphocyte antigen 

25 CD10, expression of the proteoglycan NG2, and the propensity to co-express the 
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myeloid antigens CD 15 and CD65. This prompted the corresponding disease to be 
called Mixed Lineage Leukemia and suggested models, largely unresolved, in which 
the leukemia reflects disordered cell fate decisions or the transformation of a more 
multi-potential progenitor. 
5 Generally, therapeutic treatment is more successful when tailored to the 

specific type of leukemia. Thus, a need exists for accurate and efficient methods for 
diagnosis of leukemia and identification of subclasses of leukemias. 

SUMMARY OF THE INVENTION 

As described herein, MLL is significantly different from ALL and AML , as 
1 0 assessed by gene expression profiling. The expression profiles reported here reveal 
that lymphoblastic leukemias bearing MLL translocations display a remarkably ' 
uniform and highly distinct pattern that clearly distinguishes them from conventional 
ALL or AML and warrants designation as a distinct disease, MLL. 

In one embodiment, the invention relates to a method of diagnosing mixed 
1 5 lineage leukemia, acute lymphoblastic leukemia or acute myelogenous leukemia, 
comprising determining a gene expression profile of a gene expression product from 
at least one informative gene from one or more cells, wherein the cells are selected 
from the group consisting of mononuclear blood cells and bone marrow cells, and 
wherein the gene expression profile is correlated with mixed lineage leukemia, acute 
20 lymphoblastic leukemia or acute myelogenous leukemia In one embodiment, the 
gene expression product is RNA. In a preferred embodiment, the gene expression 
profile is determined utilizing specific hybridization probes. In a particularly 
preferred embodiment, the gene expression profile is determined utilizing 
oligonucleotide microarrays. In a preferred embodiment, the gene expression profile 
25 is determined utilizing antibodies. In particular embodiments, the informative 
gene(s) is selected from the group consisting of the genes in FIGs. 1 A, IB, 2A-2F, 
3A-3D, and 5, and Tables 1 and 2. 

The invention further relates to a method of diagnosing mixed lineage 
leukemia, acute lymphoblastic leukemia or acute myelogenous leukemia, comprising 
30 deterrnining a gene expression profile of mRNA from at least one informative gene, 
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wherein the mKNA is isolated from one or more cells of an individual selected from 
the group consisting of mononuclear blood cells and bone marrow cells; and 
comparing the obtained gene expression profile to a gene expression profile of a 
control sample selected from the group consisting of a mixed lineage leukemia 
5 sample, an acute lymphoblastic leukemia sample and an acute myelogenous 

leukemia sample, wherein the gene expression profile of the cell from the individual 
is indicative of mixed lineage leukemia, acute lymphoblastic leukemia or acute 
myelogenous leukemia. 

The invention also relates to a method of diagnosing mixed lineage 
10 leukemia, comprising determining a gene expression profile of a gene expression 
product from at least one informative gene from one or more cells selected from the 
group consisting of mononuclear cells and bone marrow cells, wherein the gene 
expression profile is correlated with mixed lineage leukemia. 

The invention further relates to a method of identifying a compound for use 
1 5 in treating mixed lineage leukemia, acute lymphoblastic leukemia, or acute 

myelogenous leukemia, comprising determining a gene expression profile of a gene 
expression product from at least one informative gene from one or more cells 
selected from the group consisting of mononuclear blood cells and bone marrow 
cells of an individual with mixed lineage leukemia, acute lymphoblastic leukemia, or 
20 acute myelogenous leukemia; administering a test agent to the individual; 

deterniining a gene expression profile of a gene expression product from at least one 
informative gene from one or more cells selected from the group consisting of 
mononuclear blood cells and bone marrow cells from the individual; and comparing 
the two gene expression profiles, wherein if the gene expression profile from the 
25 individual after adniinistration of the agent is correlated with effective treatment of 
mixed lineage leukemia, acute lymphoblastic leukemia, or acute myelogenous 
leukemia, the test agent is a therapeutic agent. In one embodiment, the disease is 
mixed lineage leukemia, and a decrease in the expression of the informative gene 
selected from the group consisting of FLT3, MEIS1, and HoxA9, is indicative of 
30 effective treatment of mixed lineage leukemia. In another embodiment, the gene 
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expression profiles compared prior to and after administration of the test agent 
consist of one or more of the same informative genes. 

The invention also relates to a method for evaluating drug candidates for 
their effectiveness in treating mixed lineage leukemia, acute lymphoblastic 
5 leukemia, or acute myelogenous leukemia, comprising contacting a cell sample or 
lysate thereof with a candidate compound, wherein the cell is selected from the 
group consisting of mononuclear blood cells and bone marrow cells; , and detecting 
an alteration of a gene expression profile of a gene expression product from at least 
one informative gene from the cell sample or lysate thereof, wherein a compound 
10 that increases the gene expression profile of at least one informative gene which is 
decreased in mixed lineage leukemia, acute lymphoblastic leukemia, or acute 
myelogenous leukemia is a compound for use in treating mixed lineage leukemia, 
acute lymphoblastic leukemia, or acute myelogenous leukemia. 

The invention further relates to a method of identifying a compound for use 
15 in treating mixed lineage leukemia, acute lymphoblastic leukemia, or acute 

myelogenous leukemia, comprising contacting a cell sample or lysate thereof with a 
candidate compound, wherein the cell is selected from the group consisting of 
mononuclear blood cells and bone marrow cells; and detecting an alteration of a 
gene expression profile of a gene expression product from at least one informative 
gene from the cell sample or lysate thereof, wherein a compound that decreases the 
gene expression profile of at least one informative gene which is increased in mixed 
lineage leukemia, acute lymphoblastic leukemia, or acute myelogenous leukemia is a 
compound for use in treatingmixed lineage leukemia, acute lymphoblastic leukemia, 
or acute myelogenous leukemia. 

The invention further relates to a method of identifying a compound for use 
in treating mixed lineage leukemia, acute lymphoblastic leukemia, or acute 
myelogenous leukemia, comprising contacting a cell sample or lysate thereof with a 
candidate compound, wherein the cell is selected from the group consisting of 
mononuclear blood cells and bone marrow cells; and detecting an alteration of a 
gene expression profile of a gene expression product from at least one informative 
gene from the cell sample or lysate thereof, wherein a compound that increases the 
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gene expression profile of at least one informative gene which is decreased in mixed 
lineage leukemia, acute lymphoblastic leukemia, or acute myelogenous leukemia is a 
compound for use in treating acute lymphoblastic leukemia. In a preferred 
embodiment, the disease is mixed lineage leukemia, and the informative gene is 
5 selected from the group consisting of FLT3, MEJSJ, and HoxA9. 

In another aspect, the invention relates to a method of identifying a 
compound that modulates (increases or decreases) the biological activity of an 
informative gene. 

In. still another aspect, the invention features a method of identifying a 
10 compound that decreases the biological activity of an informative gene expression 
product having increased expression in MLL, AML, or ALL. The method comprises 
contacting the informative gene expression product with a candidate compound 
under conditions suitable for activity of the informative gene expression product; 
and assessing the biological activity level of the informative gene expression 
1 5 product. A candidate compound that decreases the biological activity level of the 
informative gene expression product relative to a control is a compound that 
decreases the biological activity of the informative gene expression product having 
increased expression in MLL, AML, or ALL. In one embodiment, the method is 
carried out in a cell or animal. In another embodiment, the method is carried out in a 
20 cell-free system. In still another embodiment the informative gene expression 

product is selected from the gene expression products encoded by the genes in FIGs. 
1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 and 2. 

In another aspect, the invention features a method of identifying a compound 
that increses the biological activity of an informative gene expression product 
25 having decreased expression in MLL, AML, or ALL The method comprises 
contacting the informative gene expression product with a candidate compound 
under conditions suitable for biological activity of the informative gene expression 
product; and assessing the biological activity level of the informative gene 
expression product A candidate compound that increases the biological activity 
30 level of the informative gene expression product relative to a control is a compound 
that increases the biological activity of the informative gene expression product 
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having decreased expression in MLL, AML, or ALL. In one embodiment, the 
method is carried out in a cell or animal. In another embodiment, the method is 
carried out in a cell-free system. In still another embodiment the informative gene 
expression product is selected from the gene expression products encoded by the 
5 genes in FIGs. 1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 and 2. 

In other embodiments, screens can be carried out for compounds that further 
increase the expression of a gene or the biological activity of a gene expression 
product already overexpressed in MLL, ALL, or AML, or that further decrease the 
expression of a gene or the biological activity of a gene expression product already 

10 underexpressed in MLL, ALL, or AML. These compounds can be identified 
according the screening methods described herein. These compounds should be 
avoided during treatment regimens for MLL, ALL, or AML. 

In still another aspect, the invention features a method of identifying a 
polypeptide that interacts with an informative gene expression product having 

1 5 increased or decreased expression in MLL, AML or ALL in a yeast two-hybrid 
system. The method comprises providing a first nucleic acid vector comprising a 
nucleic acid molecule encoding a DNA binding domain and a polypeptide encoded 
by the informative gene that is increased or decreased in MLL, AML, or ALL; 
providing a second nucleic acid vector comprising a nucleic acid encoding a 

20 transcription activation domain and a nucleic acid encoding a test polypeptide; 

contacting the first nucleic acid vector with the second nucleic acid vector in a yeast 
two-hybrid system; and assessing transcriptional activation in the yeast two-hybrid 
system. An increase in transcriptional activation relative to a control indicates that 
the test polypeptide is a polypeptide that interacts with the informative gene 

25 expression product having increased or decreased expression in MLL, AML or ALL. 

The invention also relates to compounds identified according to the above- 
described screening methods. Such compounds can be used to treat mixed lineage 
leukemia, acute lymphoblastic leukemia, or acute myelogenous leukemia, as 
appropriate. 

30 The invention further relates to a method for evaluating a drug candidate for 

effectiveness in treating mixed lineage leukemia, acute lymphoblastic leukemia, or 
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acute'myelogenous leukemia, comprising determining a gene expression profile of a 
gene expression product from at least one informative gene, wherein the gene 
expression product is isolated from cells derived from a blood or bone marrow 
sample from an individual to whom the drug candidate has been adnunistered, 
wherein the gene expression profile is indicative of the effectiveness of the drag 
candidate in treating mixed lineage leukemia, acute lymphoblastic leukemia, or 
acute myelogenous leukemia. 

The invention also relates to a method for monitoring the efficacy of a mixed 
lineage leukemia, acute lymphoblastic leukemia, or acute myelogenous leukemia 
treatment, comprising determining the gene expression profile a gene expression 
product from at least one informative gene in a cell from blood samples derived 
from an individual being treated, wherein the samples are obtained at various time 
points; and comparing the treatment outcome of the samples at various times during 
treatment, wherein the efficacy of mixed lineage leukemia, acute lymphoblastic 
leukemia, or acute myelogenous leukemia treatment is determined. In one 
embodiment the gene expression profiles obtained over time is compared to gene * 
expression profiles from individuals who do not have MIX, ALL, or AML (normal 
individuals). In another embodiment, the gene expression profiles determined at 
various time points include one or more of the same informative genes. 

The invention also encompasses a method of predicting the efficacy of 
treating mixed lineage leukemia, -acute lymphoblastic leukemia, or acute 
myelogenous leukemia, comprising determining a gene expression profile of a gene 
expression product from at least one informative gene, the gene expression product 
isolated from one or more cells selected from the group consisting of mononuclear 
cells and bone marrow cells of an individual with mixed lineage leukemia, acute 
lymphoblastic leukemia, or acute myelogenous leukemia, wherein the gene 
expression profile is correlated with a treatment outcome. In one embodiment the 
gene expression profiles obtained is compared to gene expression profiles from 
individuals who do not have MIX, ALL, or AML (normal individuals) 

The invention also relates to a method of treating mixed lineage leukemia, 
comprising administering to an individual in need thereof a therapeutic amount of an 
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agent that inhibits the activity of a gene product that is increased in mixed lineage 
leukemia. In a preferred embodiment, gene product is encoded by an informative 
gene selected from the group consisting of FLT3, MEIS1, and HoxA9. 

The invention further relates to a method of treating mixed lineage leukemia, 
5 comprising administering to an individual in need thereof a therapeutic amount of an 
agent which enhances the activity of a gene product which is decreased in mixed 
lineage leukemia. 

In any of the above methods, the gene expression product maybe RNA and 
the gene expression profile can be determined utilizing specific hybridization 
1 0 probes. In a particularly preferred embodiment, the gene expression profile is 

determined utilizing oligonucleotide microarrays. In another preferred embodiment, 
the gene expression profile is determined utilizing antibodies. In particular 
- embodiments, the informative gene(s) is selected from the group consisting of the 
genes in FIGs. 1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 and 2. 
1 5 The invention also relates to an oligonucleotide microarray having 

immobilized thereon a plurality of oligonucleotide probes specific for one or more 
informative genes for diagnosing mixed lineage leukemia, acute lymphoblastic 
leukemia, or acute myelogenous leukemia wherein the informative genes are 
selected from the group consisting of the genes in FIGs. 1 A, IB, 2A-2F, 3A-3D, and 
20 5, and Tables 1 and 2. 

It is well known that proper diagnosis of disease is essential for successful 
treatment of individuals. The present invention will significantly improve the 
diagnosis of MLL, ALL, and ALL, and thereby improve the treatment of leukemic 
individuals. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

The patent or application file contains at least one drawing executed in color. 
Copies of this patent or patent application publication with color drawings will be 
provided by the Office upon request and payment of the necessary fee. 

FIG. 1 A illustrates genes that distinguish ALL from MLL. The 100 genes 
30 most highly correlated with the class distinction are shown. Each column represents 
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a leukemia sample and each row represents an individual gene. Expression levels 
are normalized for each gene where the mean is 0. Expression levels greater than 
the mean are shown in red, whereas levels less than the mean are shown in blue. 
Increasing distance from the mean is represented by increasing color intensity. 
5 FIG. IB illustrates genes that distinguish ALL (left-most 20 columns) from 

MIL (right-most 1 7 columns). The 1 00 genes most highly correlated with the class 
distinction are shown. Each column represents a leukemia sample and each row 
represents an individual gene. Expression levels are normalized for each gene where 
the mean is 0. Expression levels greater than the mean are shown in red, whereas 
1 0 levels less than the mean are shown in blue. Increasing distance from the mean is 
represented by increasing color intensity. 

FIG. 2A illustrates selected early lymphocyte gene expression in ALL and 
MIL. Relative levels of expression of CDIO in ALL and MIL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
15 data obtained from Affymetrix GENECHIP® analysis after scaling of the arrays 
based on the scaling described in the Examples. 

FIG. 2B illustrates selected early lymphocyte gene expression in ALL and 
MLL Relative levels of expression of CD19 in ALL and MIL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
20 data obtained from Affymetrix GENECHIP® analysis after scaling of the arrays 
based on the scaling described in the Examples. 

FIG. 2C illustrates selected early lymphocyte gene expression in ALL and 
MLL Relative levels of expression of IgB in ALL and MLL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
25 data obtained from Affymetrix GENECHIP® analysis after scaling of the arrays 
based on the scaling described in the Examples. 

FIG. 2D illustrates selected early lymphocyte gene expression in ALL and 
MLL. Relative levels of expression of CD24 in ALL and MIL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
30 data obtained from Affymetrix GENECHIP® analysis after scaling of the airays 
based on the scaling described in the Examples. 
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FIG. 2E illustrates selected early lymphocyte gene expression in ALL and 
MLL. Relative levels of expression of CD43 in ALL and MLL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
data obtained from Affymetrix GENECHIP® analysis after scaling of the arrays 
5 based on the scaling described in the Examples. 

FIG. 2F illustrates selected early lymphocyte gene expression in ALL and 
MLL. Relative levels of expression of CD44 in ALL and MLL samples are shown. 
Each bar represents an individual leukemia sample. The expression values are raw 
data obtained from Affymetrix GENECHIP® analysis after scaling of the arrays 
1 0 based on the scaling described in the Examples. 

FIG. 3 A illustrates selected HOXA9 gene expression in ALL and MLL. 
Relative levels of expression of HOXA9 in ALL and MLL samples are shown. The 
expression values are obtained using Affymetrix GENECHIP® analysis after scaling 
of the arrays as described in the Examples. 
1 5 FIG. 3B illustrates selected HOXA5 gene expression in ALL and MLL. 

Relative levels of expression of HOXA5 in ALL and MLL samples are shown. The 
expression values are obtained using Affymetrix GENECHIP® analysis after scaling 
of the arrays as described in the Examples. 

FIG. 3C illustrates selected HOXA4 gene expression in ALL and MLL. 
20 Relative levels of expression of HOXA4 in ALL and MLL samples are shown. The 
expression values are obtained using Affymetrix GENECHIP® analysis after scaling 
of the arrays as described in the Examples. 

FIG. 3D illustrates selected HOXA 7 gene expression in ALL and MLL. 
Relative levels of expression of HOXA 7 in ALL and MLL samples are shown. The 
25 expression values are obtained using Affymetrix GENECHIP® analysis after scaling 
of the arrays as described in the Examples. 

FIG. 4A illustrates the comparison of gene expression between ALL, MLL 
and AML, and shows the principal component analysis (PCA) plot of ALL (red), 
MLL (blue), and AML (yellow) performed using 8700 genes that passed filtering. 
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FIG. 4B illustrates the comparison of gene expression between ALL, MLL 
and AML, and shows the PCA plot comparing ALL (red), MLL (blue), and AML 
(yellow) using the 500 genes that best distinguished ATT, from AML. 

FIG. 5 illustrates genes specifically expressed in MLL, ALL or AML. The 
5 top 15 genes, and their corresponding GenBank Accession Numbers, that are most 
highly correlated with one type of leukemia versus the other two are shown. Each 
column represents a leukemia sample and each row a gene. The relative levels of 
expression are shown in red (relatively high) and blue (relatively low) as described 
inFIGs. lAandlBl. 

10 FIG. 6 illustrates the classification of ALL, MLL and AML based on gene 

expression profile through a plot showing the error rate in class prediction using a 
cross-validation approach. One sample was withheld, and the class membership of 
this sample predicted based on gene expression levels. The genes used are the top 1- 
250 genes that are best correlated with the ALL/MLL/AML three-class distinction. 

15 

DETAILED DESCRIPTION OF THE INVENTION 

Early and accurate diagnosis of disease is of paramount importance in 
rendering effective treatment. The present invention relates to the diagnosis of 
mixed lineage leukemia (MLL), acute lymphoblastic leukemia (ALL), and acute 

20 myelogenous leukemia (AML) according to the gene expression profile of a sample 
from an individual, as well as to methods of therapy and screening that utilize the 
genes identified herein as targets. 

hi one embodiment, the present invention is directed to a method of 
diagnosing mixed lineage leukemia, acute lymphoblastic leukemia and acute 

25 myelogenous leukemia, comprising isolating a gene expression product from at least 
one informative gene from one or more cells of an individual selected from the 
group consisting of mononuclear blood cells and bone marrow cells; and 
determining a gene expression profile of at least one informative gene, wherein the 
gene expression profile is correlated with mixed lineage leukemia, acute 

30 lymphoblastic leukemia and acute myelogenous leukemia. 
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In another embodiment, the present invention is directed toward a method of 
diagnosing mixed lineage leukemia, acute lymphoblastic leukemia and acute 
myelogenous leukemia, comprising isolating mRNA from one or more cells of an 
individual, wherein the cells are selected from the group consisting of mononuclear 
5 blood cells and bone marrow cells, determining a gene expression profile of at least 
one informative gene, and comparing the gene expression profile with a gene 
expression profile of a control sample selected from the group consisting of mixed 
lineage leukemia sample, acute lymphoblastic leukemia sample and acute 
myelogenous leukemia sample, wherein the gene expression profile obtained from 

10 the cells of the individual is indicative of mixed lineage leukemia, acute 
lymphoblastic leukemia or acute myelogenous leukemia. 

In one example of the above method, if the gene expression product obtained 
from the sample is similar to the gene expression product of MLL, then the 
individual is diagnosed as having MLL; and if the gene expression product obtained 

1 5 from the sample is similar to the gene expression product of ALL, then the 

individual is diagnosed as having ALL; and if the gene expression product obtained 
from the sample is similar to the gene expression product of AML, then the 
individual is diagnosed as having AML. Using similar methods, the diagnosis of • 
certain types of leukemias (MLL, ALL, or AML) can also be ruled out. 

20 "Gene expression profile" as used herein is defined as the level or amount of 

gene expression of particular genes as assessed by methods described herein. The 
gene expression profile can comprise data for one or more genes and can be 
measured at a single time point or over a period of time. 

As used herein, "gene expression products" are proteins, polypeptides, or 

25 nucleic acid molecules (e.g., mRNA, tRNA, rRNA, or cRNA) that result from 

transcription or translation of genes. The present invention can be effectively used 
to analyze proteins, peptides or nucleic acid molecules that are the result of 
transcription or translation. The nucleic acid molecule levels measured can be 
derived directly from the gene or, alternatively, from a corresponding regulatory 

30 gene or regulatory sequence element. All forms of gene expression products can be 
measured. Additionally, variants of genes and gene expression products including, 
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for example, spliced variants and polymorphic alleles, can be measured. Similarly, 
gene expression can be measured by assessing the level of protein or derivative 
thereof translated from mRNA. The sample to be assessed can be any sample that 
contains a gene expression product. Suitable sources of gene expression products, 
5 e.g., samples, can include intact cells, lysed cells, cellular material for deterrniniiig 
gene expression, or material containing gene expression products. Examples of such 
samples are brain, blood, bone marrow, plasma, lymph, urine, tissue, mucus, 
sputum, saliva or other cell samples. Methods of obtaining such samples are known 
in the art. In a preferred embodiment, mononuclear bloods cells are used. In 
1 0 another preferred embodiment, bone marrow tissue is used. 

In one embodiment, the gene expression product is a protein or polypeptide, 
m this embodiment the determination of the gene expression profile can be made 
using techniques for protein detection and quantitation known in the art. For 
example, antibodies specific for the protein or polypeptide can be obtained using 
15 methods which are routine in the art, and the specific binding of such antibodies to 
protein or polypeptide gene expression products can be detected and measured. 

The present invention also provides methods for classifying the sample. A 
sample can be classified in many ways including but not limited to leukemia 
subclass (e.g., ALL, AML, or MLL), response to a particular treatment, referred to 
20 herein as treatment outcome, or treatment efficacy. Informative genes include, but 
are not limited to, those-shown in FIGs. 1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 
and 2. Using the methods described herein, expression of numerous genes can be 
measured simultaneously. The assessment of numerous genes provides for a more 
accurate evaluation of the sample because there are more genes that can assist in 
25 classifying the sample. 

In a preferred embodiment, the gene expression product is mRNA and the 
gene expression levels are obtained, e.g., by contacting the sample with a suitable 
microarray, and determining the extent of hybridization of the nucleic acid in the 
sample to the probes on the microarray. 
30 The gene expression value measured or assessed is the numeric value 

obtained from an apparatus that can measure gene expression levels. Gene 
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expression levels refer to the amount of expression of the gene expression product, 
as described herein. The values are raw values from the apparatus, or values that are 
optionally rescaled, filtered and/or normalized. Such data is obtained, for example, 
from a GeneChip® probe array or Microarray (Affymetrix, Ihc.)(U.S, Patent Nos. 
5 5,631,734, 5,874,219, 5,861,242, 5,858,659, 5,856,174, 5,843,655, 5,837,832, 
5,834,758, 5,770,722, 5,770,456, 5,733,729, 5,556,752, all of which are 
incorporated herein by reference in their entirety), and the expression levels are 
calculated with software (e.g., Affymetrix GENECHIP® software). Nucleic acids 
(e.g., mRNA) from a sample which has been subjected to particular stringency 
10 conditions hybridize to the probes on the chip. The nucleic acid to be analyzed (e.g., 
the target) is isolated, amplified and labeled with a detectable label, (e.g. i} 32 P or 
fluorescent label) prior to hybridization to the arrays. Once hybridization occurs, the 
arrays are inserted into a scanner which can detect patterns of hybridization. The 
hybridization data are collected as light emitted from the labeled groups which is 
1 5 now bound to the probe array. The probes that perfectly match the target produce a 
stronger signal than those that have mismatches. Since the sequence and position of 
each probe on the array are known, by complementarity, the identity of the target 
nucleic acid applied to the probe is determined. Quantitation of gene profiles from 
the hybridization of labeled mRNA /DNA microarray can be performed by scanning 
the microarrays to measure the amount of hybridization at each position on the 
microarray with an Affymetrix scanner (Affymetrix, Santa Clara, CA ). For each 
stimulus a time series of mRNA levels (C={Cl,C2,C3,...Cn}) and a corresponding 
time series of mRNA levels (M={Ml,M2,M3,...Mn» in control medium in the same 
experiment as the stimulus is obtained. Quantitative data is then analyzed. Ci and 
Mi are defined as relative steady-state mRNA levels, where i refers to the ith 
timepoint and n to the total number of timepoints of the entire timecourse. [iM. and 
aM are defined as the mean and standard deviation of the control time course, 
respectively. Microarrays are only one method of obtaining gene expression values. 
Other methods for obtaining gene expression values known in the art or developed 
in the future can be used with the present invention. 
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Once the gene expression values are prepared, the sample can be classified. 
Genes that are particularly relevant for classification have been identified as a result 
of work described herein and are shown in FIGs. 1 A, IB, 2A-2F, 3A-3D, and 5, and 
Tables 1 and 2. The genes that are relevant for classification are referred to herein as 
5 "informative genes." Not all informative genes for a particular class distinction 
must be assessed in order to classify a sample. For example, a subset of the 
informative genes which demonstrate a high correlation with a class distinction can 
be used. This subset can be, for example, one or more genes, for example 2, 3, or 4 
genes, 5 or more genes, for example 6, 7, 8, or 9 genes, 10 or more genes, 25 or 
1 0 more genes, 45 or more genes, or 50 or more genes. Typically the accuracy of the 
classification will increase with the number of informative genes assessed. 

The correlation between gene expression profiles and class distinction can be 
determined using a variety of methods. Methods of defining classes and classifying" 
samples are described, for example, in U.S. Patent Application Serial No. 
1 5 09/544,627, filed April 6, 2000 by Golub et al., the teachings of which are 

incorporated herein by reference in their entirety. The information provided by the 
present invention, alone or in conjunction with other test results, aids in sample 
classification and diagnosis of disease. 

The present invention also provides methods for monitoring the effect of a 
20 treatment regimen in an individual by monitoring the gene expression profile for one 
or more informative genes. Treatment efficacy classification can be made by 
comparing the gene expression profile of a sample at several time points during 
treatment with respect to one or more informative genes. A treatment can be 
considered efficacious if the gene expression profile with regard to one or more 
15 informative genes tends toward a normal gene expression profile. That is, for 
example, treatment can be considered efficacious if a gene having increased 
expression in a disorder (e.g., MIX) shows reduced expression (i.e., expression 
tending toward normal expression) as a result of treatment. For example, in one 
method, a baseline gene expression profile for the individual can be determined, and 
0 repeated gene expression profiles can be determined at time points during treatment 
A shift in gene expression profile from a profile correlated with poor treatment 

i 
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outcome to profile correlated with improved treatment outcome is evidence of an 
effective therapeutic regimen, while a repeated profile correlated with poor treatment 
outcome is evidence of an ineffective therapeutic regimeri. For example, HOXA9 
and MEIS1 upregulation has been correlated with a poor prognosis. An effective 
5 therapeutic regimen might be expected to reduce the level of HOXA9 and MEIS1 
expression. Similarly, as described herein, expression of FLT3 is correlated with 
MLL. Thus, a reduction in the baseline level of FLT3 or its kinase activity can be 
indicative of an effective therapeutic. FIGs. 1A, IB, 2A-2F, 3A-3D, and 5, and 
Tables 1 and 2 provide additional gene products which can be useful in evaluating 

10 the efficacy of treatment. 

The present invention also provides information regarding the genes that are 
important in MLL treatment response, thereby providing additional targets for 
diagnosis and therapy. It is also clearthat the present invention can be used to 
generate databases comprising informative genes which will have many applications 

15 in medicine, research and industry. 

Also encompassed in the present invention is the use of gene expression 
profiles to screen for therapeutic agents. In one embodiment, the present invention 
is directed to a method of screening for a therapeutic agent for an individual with 
mixed lineage leukemia, comprising isolating a gene expression product from at 

20 least one informative gene from one or more cells of the individual with mixed 
lineage leukemia; identifying a therapeutic agent by determining a gene expression 
profile of at least one informative gene before and after administration of the agent, 
wherein if the gene expression profile from the individual after administration of the 
agent is correlated with effective treatment of mixed lineage leukemia the agent is 

25 identified as a therapeutic agent. In another embodiment, the cells are selected from 
the group consisting of mononuclear blood cells and bone marrow cells. 
Alternatively, the above method can utilize a cell line derived from an individual 
with mixed lineage leukemia. 

The invention also provides methods (also referred to herein as "screening 

30 assays 11 ) for identifying agents or compounds (e.g., fusion proteins, polypeptides, 
peptidomimetics, prodrugs, receptors, binding agents, antibodies, small molecules or 
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other drugs, or ribozymes) that alter or modulate (e.g., increase or decrease) the 
activity of the gene expression products of the informative genes (e.g., polypeptides 
encoded by the informative genes) as described herein, or that otherwise interact 
with the informative genes and/or polypeptides described herein. Such compounds 
5 can be compounds or agents that bind to informative gene expression products 
described herein (e.g., the polypeptides encoded by the informative genes in FIGs. 
1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 and 2), and that have a stimulatory or 
inhibitory effect on, for example, activity of the polypeptide encoded by an 
informative gene described herein; or that change (e.g., enhance or inhibit) the 
1 0 ability of a polypeptide encoded by an informative gene to interact with compounds 
or agents that bind such an informative gene polypeptide; or that alter 
post-translational processing of such a polypeptide (e.g., agents that alter proteolytic 
processing to direct the polypeptide from where it is normally synthesized to another 
location in the cell, such as the cell surface or the nucleus; or agents that alter 
15 proteolytic processing such that more polypeptide is released from the cell, etc.). In 
one example, the binding agent is an MIX binding agent. As used herein, by an 
MLL binding agent" is meant an agent as described herein that binds to a 
polypeptide encoded by an informative gene of the present invention and modulates 
the occurrence, severity, or progression of mixed lineage leukemia. The modulation 
20 can be an increase or a decrease in the occurrence, severity, or progression of 

prostate cancer. In addition, an MLL binding agent includes an agent that binds to a 
polypeptide that is upstream (earlier) or downstream (later) of the cell signaling 
events mediated by a polypeptide encoded by an informative gene of the present 
invention, and thereby modulates the overall activity of the signaling pathway; in 
25 turn, the mixed lineage leukemia disease state of is modulated. 

The candidate compound can cause an alteration in the activity of a 
polypeptide encoded by an informative gene of the present invention. For example, 
the activity of the polypeptide can be altered (increased or decreased) by at least 
1.5-fold to 2-fold, at least 3-fold, or, at least 5-fold, relative to the control. 
30 Alternatively, the polypeptide activity can be altered, for example, by at least 1 0%, 
at least 20%, 40%, 50%, or 75%, or by at least 90%, relative to the control. 
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In one embodiment, the invention provides assays for screening candidate 
compounds or test agents to identify compounds that bind to or modulate the activity 
of a polypeptide encoded by an informative gene described herein (or biologically 
active portion(s) thereof), as well as agents identifiable by the assays. As used 
5 herein, a "candidate compound" or "test agent" is a chemical molecule, be it 
naturally-occurring or artificially-derived, and includes, for example, peptides, 
proteins, synthesized molecules, for example, synthetic organic molecules, 
naturally-occurring molecule, for example, naturally occurring organic molecules, 
nucleic acid molecules, and components thereof. 
10 In general, candidate compounds for use in the present invention may be 

identified from large libraries of natural products or synthetic (or semi-synthetic) 
extracts or chemical libraries according to methods known in the art. Those skilled 
in the field of drug discovery and development will understand that the precise 
source of test extracts or compounds is not critical to the screening procedure(s) of 
15 the invention. Accordingly, virtually any number of chemical extracts or 
compounds can be screened using the exemplary methods described herein. 
Examples of such extracts or compounds include, but are not limited to, plant-, 
fungal-, prokaryotic- or animal-based extracts, fermentation broths, and synthetic 
compounds, as well as modification of existing compounds. Numerous methods are 
20 also available for generating random or directed synthesis (e.g., semi-synthesis or 
total synthesis) of any number of chemical compounds, including, but not limited to, 
saccharide-, lipid-, peptide-, and nucleic acid-based compounds. Synthetic 
compound libraries are commercially available, e.g., from Brandon Associates 
(Merrimack, NH) and Aldrich Chemical (Milwaukee, WI). Alternatively, libraries 
25 of natural compounds in the form of bacterial, fungal, plant, and animal extracts are 
commercially available from a number of sources, including Biotics (Sussex, UK), 
Xenova (Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, FL), and 
PharmaMar, U.S.A. (Cambridge, MA). In addition, natural and synthetically 
produced libraries are generated, if desired, according to methods known in the art, 
30 e.g., by standard extraction and fractionation methods. For example, candidate 

compounds can be obtained using any of the numerous approaches in combinatorial 
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library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and 
synthetic library methods using affinity chromatography selection. The biological 
library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, Anticancer Drug Des., 12: 145 (1997)). Furthermore, if desired, 
any library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

In addition, those skilled in the art of drug discovery and development 
readily understand that methods for dereplication (e.g., taxonomic dereplication, 
biological dereplication, and chemical dereplication, or any combination thereof) or 
the elimination of replicates or repeats of materials already known for their activities 
should be employed whenever possible. 

When a crude extract is found to modulate (i.e., stimulate or inhibit) the 
expression and/or activity of the informative genes and/or their encoded 
polypeptides, further fractionation of the positive lead extract is necessary to isolate 
chemical constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization and 
identification of a chemical entity within the crude extract having an activity that 
stimulates or inhibits-nucleic acid expression, polypeptide expression, or polypeptide 
biological activity. The same assays described herein for the detection of activities 
in mixtures of compounds can be used to purify the active component and to test 
derivatives thereof. Methods of fractionation and purification of such heterogenous 
extracts are known in the art. If desired, compounds shown to be useful agents for 
treatment are chemically modified according to methods known in the art. 
Compounds identified as being of therapeutic value may be subsequently analyzed 
using animal models for diseases in which it is desirable to alter the activity or 
expression of the nucleic acids or polypeptides of the present invention. 

In one embodiment, to identify candidate compounds that alter the biological 
activity of a polypeptide encoded by an informative gene as described herein, a cell, 
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tissue, cell lysate, tissue lysate, or solution containing or expressing a polypeptide 
encoded by the informative gene (e.g., a polypeptide encoded by a gene in any of 
FIGs. 1 A, IB, 2A-2F, 3A-3D, and 5, and Tables 1 and 2), or a fragment or derivative 
thereof can be contacted with a candidate compound to be tested under conditions 
5 suitable for biological activity of the polypeptide. Alternatively, the polypeptide can 
be contacted directly with the candidate compound to be tested. The level (amount) 
of polypeptide biological activity is assessed/measured, either directly or indirectly, 
and is compared with the level of biological activity in a control (i.e., the level of 
activity of the polypeptide or active fragment or derivative thereof in the absence of 
10 the candidate compound to be tested, or in the presence of the candidate compound 
vehicle only). If the level of the biological activity in the presence of the candidate 
compound differs, by an amount that is statistically significant, from the level of the 
biological activity in the absence of the candidate compound, or in the presence of 
the candidate compound vehicle only, then the candidate compound is a compound 
15 that alters the biological activity of the polypeptide encoded by an informative gene 
of the invention. For example, an increase in the level of polypeptide biological 
activity relative to a control, indicates that the candidate compound is a compound 
that enhances (is an agonist of) the polypeptide biological activity. Similarly, a 
decrease in the polypeptide biological activity relative to a control, indicates that the 
20 candidate compound is a compound that inhibits (is an antagonist of) the 
polypeptide biological activity. 

In another embodiment, the level of biological activity of a polypeptide 
encoded by an informative gene, or a derivative or fragment thereof in the presence 
of the candidate compound to be tested, is compared with a control level that has 
25 previously been established. A level of polypeptide biological activity in the 

presence of the candidate compound that differs from (i.e., increases or decreases) 
the control level by an amount that is statistically significant indicates that the 
compound alters the biological activity of the polypeptide. 

The present invention also relates to an assay for identifying compounds 
30 (e.g., antisense nucleic acids, fusion proteins, polypeptides, peptidomimetics, 

prodrugs, receptors, binding agents, antibodies, small molecules or other drugs, or 
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ribozymes) that alter (e.g., increase or decrease) expression (e.g., transcription or 
translation) of an informative gene or that otherwise interact with an informative 
gene described herein, as well as compounds identifiable by the assays. For 
example, a solution containing an informative gene can be contacted with a 
5 candidate compound to be tested. The solution can comprise, for example, cells 
containing the informative gene or cell lysate containing the informative gene; 
alternatively, the solution can be another solution that comprises elements necessary 
for transcription/translation of the informative gene. Cells not suspended in solution 
can also be employed, if desired. The level and/or pattern of informative gene 
1 0 expression (e.g., the level and/or pattern ofmRNA or protein expressed) is assessed, 
and is compared with the level and/or pattern of expression in a control (i.e., the 
level and/or pattern of the informative gene expressed in the absence of the 
candidate compound, or in the presence of the candidate compound vehicle only). If 
the expression level and/or pattern in the presence of the candidate compound differs 
15 by an amount or in a manner that is statistically significant from the level and/or 
pattern in the absence of the candidate compound, or in the presence of the candidate 
compound vehicle only, then the candidate compound is a compound that alters the 
expression of an informative gene. Enhancement of informative gene expression 
indicates that the candidate compound is an agonist of informative gene polypeptide 
20 activity. Similarly, inhibition of informative gene expression indicates that the 
candidate compound is an antagonist of informative gene polypeptide activity. 

In another embodiment, the level and/or pattern of an informative gene in the 
presence of the candidate compound to be tested, is compared with a control level 
and/or pattern that has previously been established. A level and/or pattern 
25 informative gene expression in the presence of the candidate compound that differs 
from the control level and/or pattern by an amount or in a manner that is statistically 
significant indicates that the candidate compound alters informative gene 
expression. 

In another embodiment of the invention, compounds that alter the expression 
30 of an informative gene, or that otherwise interact with an informative gene described 
herein, can be identified using a cell, cell lysate, or solution containing a nucleic acid 
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encoding the promoter region of the informative gene operably linked to a reporter 
gene. As used herein by "promoter" means a minimal nucleotide sequence sufficient 
to direct transcription, and by "operably linked" means that a gene and one or more 
regulatory sequences are connected in such a way as to permit gene expression when 
5 the appropriate molecules (e.g., transcriptional activator proteins) are bound to the 
regulatory sequences. Examples of reporter genes and methods for operably linking 
a reporter gene to a promoter are known in the art. After contact with a candidate 
compound to be tested, the level of expression of the reporter gene (e.g., the level of 
mRNA or of protein expressed) is assessed, and is compared with the level of 
expression in a control (i.e., the level of expression of the reporter gene in the 
absence of the candidate compound, or in the presence of the candidate compound 
vehicle only). If the level of expression in the presence of the candidate compound 
differs by an amount-or in a manner that is statistically significant from the level in 
the absence of the candidate compound, or in the presence of the candidate 
compound vehicle only, then the candidate compound is a compound that alters the 
expression of the informative gene, as indicated by its ability to alter expression of 
the reporter gene that is operably linked to the informative gene promoter. 
Enhancement of the expression of the reporter gene indicates that the compound is 
an agonist of the informative gene polypeptide activity. Similarly, inhibition of the 
expression of the reporter gene indicates that the compound is an antagonist of the 
informative gene polypeptide activity. 

In another embodiment, the level of expression of the reporter in the 
presence of the candidate compound to be tested, is compared with a control level 
that has been established previously. A level in the presence of the candidate 
compound that differs from the control level by an amount or in a manner that is 
statistically significant indicates that the candidate compound alters informative 
gene expression. 

The present invention also features methods of detecting and/or identifying a 
compound that alters the interaction between a polypeptide encoded by an 
informative gene and a polypeptide (or other molecule) with which the polypeptide 
normally interacts with (e.g., in a cell or under physiological conditions). In one 
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example, a cell or tissue that expresses or contains a compound (e.g., a polypeptide 
or other molecule) that interacts with a polypeptide encoded by an informative gene 
(such a molecule is referred to herein as a "polypeptide substrate") is contacted with 
the informative gene polypeptide in the presence of a candidate compound, and the 
5 ability of the candidate compound to alter the interaction between the polypeptide 
encoded by the informative gene and the polypeptide substrate is detennined, for 
example, by assaying activity of the polypeptide. Alternatively, a cell lysate or a 
solution containing the informative gene polypeptide, the polypeptide substrate, and 
the candidate compound can be used. A compound that binds to the informative 
1 0 gene polypeptide or to the polypeptide substrate can alter the interaction between the 
informative gene polypeptide and the polypeptide substrate by interfering with 
(inhibiting), or enhancing the ability of the informative gene polypeptide to bind to, 
associate with, or otherwise interact with the polypeptide substrate. 

Determining the ability of the candidate compound to bind to the informative 
15 gene polypeptide or a polypeptide substrate can be accomplished, for example, by 
coupling the candidate compound with a radioisotope or enzymatic label such that 
binding of the candidate compound to the informative gene polypeptide or 
polypeptide substrate can be detennined by directly or indirectly detecting the 
candidate compound labeled with 125 I, 35 S, 14 C, or 3 H, and the detecting the 
20 radioisotope (e.g., by direct counting of radioemmission or by scintillation 

counting). Alternatively, the candidate compound can be enzymatically labeled 
with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and 
the enzymatic label is then detected by determination of conversion of an 
appropriate substrate to product. In another alternative, one of the other components 
25 of the screening assay (e.g., the polypeptide substrate or the informative gene 
polypeptide) can be labeled, and alterations in the interaction between the 
informative gene polypeptide and the polypeptide substrate can be detected, ha these 
methods, labeled unbound components can be removed (e.g., by washing) after the 
interaction step in order to accurately detect the effect of the candidate compound on 
30 the interaction between the informative gene polypeptide and the polypeptide 
substrate. 
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It is also within the scope of this invention to determine the ability of a 
candidate compound to interact with the informative gene polypeptide or 
polypeptide substrate without the labeling of any of the interactants. For example, a 
microphysiometer can be used to detect the interaction of a candidate compound 
5 with a polypeptide encoded by an informative gene or a polypeptide substrate 
without the labeling of either the candidate compound, the polypeptide encoded by 
the informative gene, or the polypeptide substrate (McConnell et al., Science 257: 
1906-1912 (1992)). As used herein, a "microphysiometer" (e.g., CYTOSENSOR™) 
is an analytical instrument that measures the rate at which a cell acidifies its 
1 0 environment using a light-addressable potentiometric sensor (LAPS). Changes in 
this acidification rate can be used as an indicator of the interaction between ligand 
and polypeptide. 

In another embodiment of the invention, assays can be used-to identify 
polypeptides that interact with one or more polypeptides encoded by an informative 

1 5 gene. For example, a yeast two-hybrid system such as that described by Fields and 
Song (Fields and Song, Nature 340: 245-246 (1989)) can be used to identify 
polypeptides that interact with one or more polypeptides encoded by an informative 
gene. In such a yeast two-hybrid system, vectors are constructed based on the 
flexibility of a transcription factor that has two functional domains (a DNA binding 

20 domain and a transcription activation domain). If the two domains are separated but 
fused to two different proteins that interact with one another, transcriptional 
activation can be achieved, and transcription of specific markers (e.g., nutritional 
markers such as His and Ade, or color markers such as lacZ) can be used to identify 
the presence of interaction and transcriptional activation. For example, in the 

25 methods of the invention, a first vector is used that includes a nucleic acid encoding 
a DNA binding domain and a polypeptide encoded by an informative gene, or 
fragment or derivative thereof, and a second vector is used that includes a nucleic 
acid encoding a transcription activation domain and a nucleic acid encoding a 
polypeptide that potentially may interact with the informative gene polypeptide, or 

30 fragment or derivative thereof. Incubation of yeast containing the first vector and 
the second vector under appropriate conditions (e.g., mating conditions such as used 
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in the MATCHMAKER™ system from Clontech) allows identification of colonies 
that express the markers of the polypeptide^). These colonies can be examined to 
identify the polypeptide(s) that interact with the polypeptide encoded by the 
informative gene or a fragment or derivative thereof. Such polypeptides may be 
5 useful as compounds that alter the activity or expression of an informative gene 
polypeptide. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize a polypeptide encoded by an 
informative gene, or a polypeptide substrate, or other components of the assay on a 

1 0 solid support, in order to facilitate separation of complexed from uncomplexed 
forms of one or both of the polypeptides, as well as to accommodate automation of 
the assay. Binding of a candidate compound to the polypeptide, or interaction of the 
polypeptide with a polypeptide substrate in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 

1 5 Examples of such vessels include microtitre plates, test tubes, and micro-centrifuge 
tubes. In one embodiment, a fusion protein (e.g., a glutathione-S-transferase fusion 
protein) can be provided that adds a domain that allows the informative gene 
polypeptide, or the polypeptide substrate to be bound to a matrix or other solid 
support. 

20 This invention further pertains to novel compounds identified by the above- 

described screening assays. Accordingly, it is within the scope of this invention to 
further use a compound identified as described herein in an appropriate animal 
model. For example, a compound identified as described herein can be used in an 
animal model to determine the efficacy, toxicity, or side effects of treatment with 

25 such a compound. Alternatively, a compound identified as described herein can be 
used in an animal model to determine the mechanism of action of such a compound. 
Furthermore, this invention pertains to uses of novel compounds identified by the 
above-described screening assays for treatments as described herein. In addition, a 
compound identified as described herein can be used to alter activity of a 

30 polypeptide encoded by an infonnative gene, or to alter expression of the 

informative gene, by contacting the polypeptide or the nucleic acid molecule (or 
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contacting a cell comprising the polypeptide or the nucleic acid molecule) with the 
compound identified as described herein. 

The present invention encompasses a method of treating MLL, AML or 
ALL, comprising the administration of an agent which modulates the expression 
5 level or activity of an informative gene product. A therapeutic agent may increase or 
decrease the level or activity of the gene product. For example, an inhibitor of the 
kinase FLT3 could be useful in treating MLL, Other suitable therapeutic targets for 
drug development include genes described herein in FIGs. 1 A, IB, 2A-2F, 3A-3D, 
and 5, and Tables 1 and 2. 

10 The present invention further relates to antibodies that specifically bind a 

polypeptide, preferably an epitope, of an informative gene of the present invention 
(as determined, for example, by immunoassays, a technique well known in the art 
for assaying specific antibody-antigen binding). Antibodies of the invention include, 
but are not limited to, polyclonal, monoclonal, multispecific, human, humanized or 

15 chimeric antibodies, single chain antibodies, Fab fragments, F(ab') fragments, 

fragments produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies 
(including, for example, anti-Id antibodies to antibodies of the invention), and 
epitope-binding fragments of any of the above. 

The term "antibody," as used herein, refers to immunoglobulin molecules 

20 and immunologically active portions of immunoglobulin molecules, and more 

specifically, molecules that contain an antigen binding site that specifically binds an 
antigen. The immunoglobulin molecules of the invention can be of any type (for 
example, IgG, IgE, IgM, IgD, IgA and IgY), and of any class (for example, IgGl, 
IgG2, IgG3, IgG4, IgAl and IgA2) or subclass of an immunoglobulin molecule. 

25 In one embodiment, the antibodies are antigen-binding antibody fragments 

and include, without limitation, Fab, Fab 1 and F(ab') 2 , Fd, single-chain Fvs (scFv), 
single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either 
a V L or V H domain. Antigen-binding antibody fragments, including single-chain 
antibodies, can comprise the variable region(s) alone or in combination with the 

30 entirety or a portion of one or more of the following: hinge region, CHI, CH2, and 
CH3 domains. Also included in the invention are antigen-binding fragments also 
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comprising any combination of variable region(s) with a hinge region, CHI, CH2, 
and/or CH3 domains. 

The antibodies of the invention may be from any animal origin including 
birds and mammals. Preferably, the antibodies are human, murine, donkey, sheep, 
5 rabbit, goat, guinea pig, hamster, horse, or chicken. 

As used herein, "human" antibodies include antibodies having the amino acid 
sequence of a human immunoglobulin and include antibodies produced by human B 
cells, or isolated from human sera, human immunoglobulin libraries or from animals 
transgenic for one or more human immunoglobulins and that do not express 
1 0 endogenous immunoglobulins, as described in U.S. Pat. No. 5,939,598 by 
Kucherlapati et al, for example. 

The antibodies of the present invention may be monospecific, bispecific, 
trispecific or of greater multispecificity. Multispecific antibodies may be specific 
for different epitopes of a polypeptide of the present invention or may be specific for 
1 5 both a polypeptide of the present invention as well as for a heterologous epitope, 
such as a heterologous polypeptide or solid support material. 

Antibodies of the present invention may be described or specified in terms of 
the epitope(s) or portion(s) of a polypeptide of the present invention that they 
recognize or specifically bind. The epitope(s) or polypeptide portions) may be 
20 specified, for example, by N-terminal and/or C-terminal positions, or by size in 
contiguous amino acid residues. Antibodies that specifically bind any epitope or 
polypeptide encoded by an informative gene of the present invention may also be 
excluded. Therefore, the present invention includes antibodies that specifically bind 
a polypeptide encoded by an informative gene of the present invention, and allows 
25 for the exclusion of the same. 

The term "epitope," as used herein, refers to a portion of a polypeptide which 
contacts an antigen-binding site(s) of an antibody or T cell receptor. Specific 
binding of an antibody to an antigen having one or more epitopes excludes non- 
specific binding to unrelated antigens, but does not necessarily exclude cross- 
3 0 reactivity with other antigens with similar epitopes. 
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Antibodies of the present invention may also be described or specified in 
terms of their cross-reactivity. Antibodies of the present invention may not display 
any cross-reactivity, such that they do not bind any other analog, ortholog, or 
homolog of a polypeptide of the present invention. Alternatively, antibodies of the 
5 invention can bind polypeptides with at least about 95%, 90%, 85%, 80%, 75%, 
70%, 65%, 60%, 55%, or 50% identity (as calculated using methods known in the 
art) to a polypeptide encoded by an informative gene of the present invention. 
Further included in the present invention are antibodies that bind polypeptides 
encoded by informative genes that hybridize to an informative gene of the present 
1 0 invention under stringent hybridization conditions, as will be appreciated by one of 
skill in the art. 

Antibodies of the present invention can also be described or specified in 
terms of their binding affinity to a polypeptide of the invention. Preferred binding 
affinities include those with a dissociation constant or Kd less than 5x10* M, 1 0" 6 M, 
15 5xl0- 7 M, 10- 7 M, 5xl0" 8 M, 10" 8 M, 5xl0' 9 M, 10~ 9 M, 5xl0- 10 M, 10' 10 M, 5X10' 11 M, 
10- u M, 5xl0" 12 M, 10~ 12 M, 5xl0' 13 M, 10" 13 M, 5xlO" 14 M, 10" 13 M, 5xl0" 15 M, and 
10' 15 M. 

The invention also provides antibodies that competitively inhibit binding of 
an antibody to an epitope of a polypeptide of the invention, as determined by any 
20 method known in the art for determining competitive binding, for example, using 
- immunoassays. In particular embodiments, the antibody competitively inhibits 
binding to the epitope by at least about 90%, 80%, 70%, 60%, or 50%. 

Antibodies of the present invention can act as agonists or antagonists of 
polypeptides encoded by the informative genes of the present invention. For 
25 example, the present invention includes antibodies which disrupt interactions with 
the polypeptides encoded by the informative genes of the invention either partially or 
fully. The invention also includes antibodies that do not prevent binding, but 
prevent activation or activity of the polypeptide. Activation or activity (for example, 
signaling) may be determined by techniques known in the art. Also included are 
30 antibodies that prevent both binding to and activity of a polypeptide encoded by an 
informative gene. Likewise included are neutralizing antibodies. 
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Antibodies of the present invention may be used, for example, and without 
limitation, to purify, detect, and target the polypeptides encoded by the informative 
genes described herein, including both in vitro and in vivo diagnostic and 
therapeutic methods. For example, the antibodies have use in immunoassays for 
5 qualitatively and quantitatively measuring levels of the polypeptides in biological 
samples. See, for example, Harlow et al, Antibodies: A Laboratory Manual (Cold 
Spring Harbor Laboratory Press, 2nd ed. 1988). 

As discussed in more detail below, the antibodies of the present invention 
may be used either alone or in combination with other compositions. The antibodies 
1 0 may further be recombinantly fused to a heterologous polypeptide at the N- and/or 
C-terminus or chemically conjugated (including covalent and non-covalent 
conjugations) to polypeptides or other compositions. For example, antibodies of the 
present invention may be recombinantly fused or conjugated to molecules useful as 
labels in detection assays, or effector molecules such as heterologous polypeptides, 
15 drugs, or toxins. 

The antibodies of the invention include derivatives that are modified, for 
example, by the covalent attachment of any type of molecule to the antibody such 
that covalent attachment does not prevent the antibody from recognizing its epitope. 
For example, but not by way of hnutation, the antibody derivatives include 

20 antibodies that have been modified, for example, by glycosylation, acetylation, 

pegylation, phosphorylation, amidation, derivatization by known protectmg/blocking 
groups, proteolytic cleavage, or linkage to a cellular ligand or other protein. Any of 
numerous chemical modifications can be carried out by known techniques, 
including, but not limited to, specific chemical cleavage, acetylation, formylation, 

>5 and metabolic synthesis of tunicamycin. Additionally, the derivative can contain 
one or more non-classical amino acids. 

The antibodies of the present invention can be generated by any suitable 
method known in the art Polyclonal antibodies to an antigen-of-interest can be 
produced by various procedures well known in the art. For example, a polypeptide 

0 of the invention can be adfriinistered to various host animals including, but not 

limited to, rabbits, mice, rats, or the like, to induce the production of sera containing 
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polyclonal antibodies specific for the antigen. Various adjuvants can be used to 
increase the immunological response, depending on the host species, and include, 
but are not limited to, Freund's adjuvant (complete and incomplete), mineral gels 
such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic 
5 polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, 
dinitrophenol, and potentially useful human adjuvants such as BCG (Bacille 
Calmette-Guerin) and corynebacterium parvum. Such adjuvants are well known in 
the art. 

Monoclonal antibodies can be prepared using a wide variety of techniques 

1 0 also known in the art, including hybridoma cell culture, recombinant, and phage 

display technologies, or a combination thereof. For example, monoclonal antibodies 
can be produced using hybridoina techniques as is known in the art and taught, for 
example, in Harlow et al., Antibodies: A Laboratory-Manual (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). The term "monoclonal antibody" as used herein is 

1 5 not necessarily limited to antibodies produced through hybridoma technology, but 
also refers to an antibody that is derived from a single clone, including any 
eukaryotic, prokaryotic, or phage clone. 

Human antibodies are desirable for therapeutic treatment of human patients. 
These antibodies can be made by a variety of methods known in the art including 

20 phage display methods using antibody libraries derived from human 

immunoglobulin sequences. Human antibodies can also be produced using 
transgenic mice that are incapable of expressing functional endogenous 
immunoglobulins, but which can express human immunoglobulin genes. The 
transgenic mice are immunized with a selected antigen, for example, all or a portion 

25 of a polypeptide of the invention. Monoclonal antibodies directed against the antigen 
can be obtained from the immunized, transgenic mice using conventional hybridoma 
technology. The human immunoglobulin transgenes harbored by the transgenic 
mice rearrange during B cell differentiation, and subsequently undergo class 
switching and somatic mutation. Thus, using such a technique, it is possible to 

30 produce therapeutically useful IgG, IgA, IgM and IgE antibodies. For a detailed 
discussion of this technology for producing human antibodies and human 
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monoclonal antibodies and protocols for producing such antibodies, see, for 
example, PCT publications WO 98/24893; WO 96/34096; WO 96/33735; and U.S. 
Patent Nos. 5,413,923; 5,625,126; 5,633,425; 5,569,825; 5,661,016; 5,545,806; 
5,814,318; and 5,939,598. 
5 In another embodiment, antibodies to the polypeptides encoded by the 

informative genes as described herein can, in turn, be utilized to generate anti- 
idiotype antibodies that "mimic" polypeptides of the invention using techniques well 
known to those skilled in the art. (See, for example, Greenspan & Bona, FASEB J. 
7(5):437-444 (1989) and Nissinoff, J. Immunol. 147(8):2429-2438 (1991)). For 
1 0 example, antibodies that bind to and competitively inhibit polypeptide 

multimerization and/or binding of a polypeptide to a ligand can be used to generate 
anti-idiotypes that "nnmic" the polypeptide multimerization and/or binding domain 
and, as a consequence, bind to and neutralize polypeptide and/or its ligand. Such 
neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in 
15 therapeutic regimens to neutralize polypeptide ligand. For example, such anti- 
idiotype antibodies can be used to bind a polypeptide encoded by an informative 
gene and/or to bind its ligands, and thereby block its biological activity. 

The antibodies or fragments thereof of the present invention can be fused to 
marker sequences, such as a peptide to facilitate their purification. In one 
20 embodiment, the marker amino acid sequence is a hexa-histidine peptide, an HA tag, 
or a FLAG tag, as will be readily appreciated by one of skill in the art. 

The present invention further encompasses antibodies or fragments thereof 
conjugated to a diagnostic or therapeutic agent. The antibodies can be used 
diagnostically, for example, to monitor the development or progression of a tumor as 
25 part of a clinical testing procedure to determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the antibody to a detectable 
substance. Examples of detectable substances include enzymes (such as, horseradish 
peroxidase, alkaline phosphatase, beta-galactosidase, or acetylcholinesterase), 
prosthetic group (such as streptavidin/biotin and avidin/biotin), fluorescent materials 
30 (such as umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

dicMorotriazmylamine fluorescein, dansyl chloride orphycoerytbjin), luminescent 
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materials (such as luminol), bioluminescent materials (such as luciferase, luciferin, 
and aequorin), radioactive materials (such as, 125 1, 131 1, 111 la or 99 Tc), and positron 
emitting metals using various positron emission tomographies, and nonradioactive 
paramagnetic metal ions. 
5 In an additional embodiment, an antibody or fragment thereof can be 

conjugated to a therapeutic moiety such as a cytotoxin, for example, a cytostatic or 
cytocidal agent, a therapeutic agent or a radioactive metal ion. A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to cells. Examples include 
paclitaxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, 
10 etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, 
dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1- 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, 
and puromycin and analogs or homologs thereof. Therapeutic agents include, but 
are not limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6- 
15 thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., 
mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and 
lomustine (CCNU), cyclothospharnide, busulfan, dibromomannitol, streptozotocin, 
mitomycin C, and cis-dicMorodiamine platinum (IT) (DDP) cisplatin), anthracyclines 
(for example, daunorubicin (formerly daunomycin) and doxorubicin), antibiotics (for 
20 example, actinomycin, bleomycin, mithramycin, and anthramycin (AMC)), and anti- 
mitotic agents (for example, vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the therapeutic agent or drug moiety is not to be construed as limited to 
classical chemical therapeutic agents. For example, the drug moiety may be a 
protein or polypeptide possessing a desired biological activity. Such proteins may 
include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or 
diphtheria toxin; a protein such as tumor necrosis factor, a-interferon, {3-interferon, 
nerve growth factor, platelet derived growth factor, tissue plasminogen activator, a 
thrombotic agent or an anti-angiogenic agent, for example, angiostatin or endostatin; 
or, biological response modifiers such as, for example, lymphokines, mterleukins, 
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granulocyte macrophase colony stimulating factor ("GM-CSF"), granulocyte colony 
stimulating factor ("G-CSF"), or other growth factors. 

Antibodies of the invention can also be attached to solid supports. These are 
particularly useful for immunoassays or purification of the target antigen. Such solid 
5 supports include, but are not limited to, glass, cellulose, silicon, polyacrylamide, 
nylon, polystyrene, polyvinyl chloride or polypropylene. Techniques for conjugating 
such therapeutic moiety to antibodies are well known in the art, see, for example, 
Anion et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. eds., pp. 
1 0 243-56 (Alan R. Liss, Inc. 1985). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Pat. No. 4,676,980. 

An antibody of the invention, with or without conjugation to a therapeutic 
moiety, administered alone or in combination with cytotoxic factor(s) and/or 
15 cytokine(s), can be used as a therapeutic. 

Antisense antagonists of the present invention are also included. Antisense 
technology can be used to control gene expression through antisense DNA or RNA, 
or through triple-helix formation. Antisense techniques are discussed for example, 
in Okano, J., Neurochem. 56:560 (1991). The methods are based on binding of a 
20 polynucleotide to a complementary DNA or RNA. In one embodiment, an antisense 
sequence is generated internally by the organism, in another embodiment, the 
antisense sequence is separately administered (see, for example, O'Connor, J., 
Neurochem. 56:560 (1991)). 

In one embodiment, the 5' coding portion of an informative gene can be used 
25 to design an antisense RNA oligonucleotide from about 10 to 40 base pairs in length. 
Generally, a DNA oligonucleotide is designed to be complementary to a region of 
the gene involved in transcription thereby preventing transcription and the 
production of the receptor. The antisense RNA oligonucleotide hybridizes to the 
mRNA in vivo and blocks translation of the mRNA molecule into receptor 
30 polypeptide. 
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La one embodiment, the antisense nucleic acid of the invention is produced 
intracellularly by transcription from an exogenous sequence. For example, a vector 
or a portion thereof, is transcribed, producing an antisense nucleic acid of the 
invention. Such a vector contains the sequence encoding the antisense nucleic acid. 
5 The vector can remain episomal or become chromosomally integrated, as long as it 
can be transcribed to produce the desired antisense RNA. Vectors can be 
constructed by recombinant DNA technology and can be plasmid, viral, or 
otherwise, as is known to one of skill in the art. 

Expression can be controlled by any promoter known in the art to act in the 

10 target cells, such as vertebrate cells, and preferably human cells. Such promoters 
can be inducible or constitutive and include, without limitation, the SV40 early 
promoter region (Bernoist and Chambon, Nature 29:304-310(1981), the promoter 
contained in the 3 ! long terminal repeat of Rous sarcoma virus (Y amamoto et aL, 
Cell 22:787-797 (1980)), the herpes thymidine promoter (Wagner et aL, Proc. Natl. 

15 Acad. Sci. U.S.A. 78:1441-1445 (1981)), and the regulatory sequences of the 
metallothionein gene (Brinster et aL, Nature 296:39-42 (1982)). 

The antisense nucleic acids of the invention comprise a sequence 
complementary to at least a portion of an RNA transcript of an informative gene. 
Absolute complementarity, although preferred, is not required. A sequence 

20 "complementary to at least a portion of an RNA," referred to herein, means a 

sequence having sufficient complementarity to be able to hybridize with the RNA, 
forming a stable duplex. The ability to hybridize will depend on both the degree of 
complementarity and the length of the antisense nucleic acid. Generally, the larger 
the hybridizing nucleic acid, the more base mismatches with the RNA it may contain 

25 and still form a stable duplex. One skilled in the art can ascertain a tolerable degree 
of mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

Oligonucleotides that are complementary to the 5 ! end of the RNA, for 
example, the 5' untranslated sequence up to and including the AUG initiation codon, 

30 are generally regarded to work most efficiently at inhibiting translation. However, 
sequences complementary to the 3' untranslated sequences of mRNAs have been 
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shown to be effective at inhibiting translation of mRNAs as well. Thus, 
oligonucleotides complementary to either the 5 - or 3'-non-translated, non-coding 
regions of a nucleotide sequence can be used in an antisense approach to inhibit 
mRNA translation. Oligonucleotides complementary to the 5 f untranslated region of 
5 the mRNA can include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions can also be used in 
accordance with the invention. In one embodiment, the antisense nucleic acids are 
at least six nucleotides in length, and are preferably oligonucleotides ranging from 
about 6 to about 50 nucleotides in length. In other embodiments, the oligonucleotide 

10 is at least about 10, 17, 25 or 50 nucleotides in length. 

The antisense oligonucleotides of the invention can be DNA or RNA, or 
chimeric mixtures, or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotide can be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, 

1 5 hybridization, and the like. The oligonucleotide can include other appended groups 
such as peptides (for example, to target host cell receptors in vivo), or agents that 
facilitate transport across the cell membrane, or the blood-brain barrier, or 
intercalating agents. 

The antisense oligonucleotide may comprise at least one modified base 

20 moiety which is selected from the group including, but not limited to, 5-fiuorouracil, 
5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4- 
acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2- 
thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, a-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, l-methylguanine,T- 

25 methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 

30 queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5- 
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methyl-2-thiouracil, 3<3-aminoO-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 2- 
5 fluoroarabinose, xylulose, and hexose. 

In yet another embodiment, the antisense bligonucleotide comprises at least 
one modified phosphate backbone selected from the group including, but not limited 
to, aphosphorothioate, aphosphorodithioate, aphosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 
10 phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded 
hybrids with complementary RNA in which, contrary to the usual p-units, the 
strands run parallel to each other (Gautier et al., Nucl. Acids Res. 15:6625-6641 
15 (1987)). The oligonucleotide is a 2'-0-methylribonucleotide (Ihoue et al., Nucl. 
Acids Res. 15:6131-6148 (1987)), or a chimeric RNA-DNA analog (Inoue et al, 
FEBS Lett. 215:327-330 (1987)). 

Antisense oligonucleotides of the invention may be synthesized by standard 
methods known in the art, for example, by use of an automated DNA synthesizer. 
20 Potential antagonists according to the invention also include catalytic RNA, 

or a ribozyme. Hammerhead ribozymes cleave mRNAs at locations dictated by 
flanking regions that form complementary base pairs with the target mRNA. The 
target mRNA has the following sequence of two bases: 5*-UG-3\ The construction 
and production of hammerhead ribozymes is well known in the art and is described 
25 more fully in HaselofFand Gerlach (Nature 334:585-591 (1988)). Preferably, the 
ribozyme is engineered so that the cleavage recognition site is located near the 5 f end 
of the mRNA in order to increase efficiency and minimize the intracellular 
accumulation of non-functional mRNA transcripts. 

Ribozymes of the invention can be composed of modified oligonucleotides 
30 (for example for improved stability, targeting, and the like). DNA constructs 

encoding the ribozyme can be under the control of a strong constitutive promoter, 
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such as, for example, pol HI or pol II promoter, so that a transfected cell will produce 
sufficient quantities of the ribozyme to destroy endogenous target mRNA and inhibit 
translation. Since ribozymes, unlike antisense molecules, are catalytic, a lower 
intracellular concentration is generally required for efficiency. 
5 The present invention also provides pharmaceutical compositions, including 

both therapeutic and prophylatic compositions. Compositions within the scope of 
. this invention include all compositions wherein the therapeutic abent, antibody, 
fragment or derivative, antisense oligonucleotide or ribozyme is contained in an 
amount effective to achieve its intended purpose. While individual needs vary, 
1 0 determination of optimal ranges of effective amounts of each component is within 
the skill of the art. The effective dose is a function of a number of factors, including 
the specific antibody, the antisense construct, ribozyme or polypeptide of the 
invention, the presence of a conjugated therapeutic agent (see below), the patient and 
their clinical status. 

1 5 Mode of aa'ministration may be by parenteral, subcutaneous, intravenous, 

intramuscular, intraperitoneal, transdermal, or buccal routes. Alternatively, or 
concurrently, administration may be orally. The dosage administered will be 
dependent upon the age, health, and weight of the recipient, kind of concurrent 
treatment, if any, frequency of treatment, and the nature of the effect desired. 

20 Such compositions generally comprise a therapeutically effective amount of 

a compound, and a pharmaceutically acceptable carrier. In a specific embodiment, 
the term "pharmaceutically acceptable" means approved by a regulatory agency of 
the Federal or a state government or listed in the U.S. Pharmacopeia or other 
generally recognized pharmacopeia for use in animals, and more particularly in 

25 humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with 
which the therapeutic is admnistered. Such pharmaceutical carriers can be sterile 
liquids, such as water and oils, including those of petroleum, animal, vegetable or 
synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. 
Water is a preferred carrier when the pharmaceutical composition is administered 

30 intravenously. Saline solutions and aqueous dextrose and glycerol solutions can also 
be employed as liquid carriers, particularly for injectable solutions. Suitable 



WO 03/008552 



-38- 



PCT/US02/22823 



pharmaceutical excipients include starch, glucose, lactose, sucrose, gelatin, malt, 
rice, flour, chalk, silica gel, sodium stearate, glycerol monostearate, talc, sodium 
chloride, dried skimmed milk, glycerol, propylene, glycol, water, ethanol and the 
like. The composition, if desired, can also contain minor amounts of wetting or 
5 emulsifying agents, or pH buffering agents. 

These compositions can take the form of solutions, suspensions, emulsion, 
tablets, pills, capsules, powders, sustained-release formulations and the like. The 
composition can be formulated as a suppository, with traditional binders and carriers 
such as triglycerides. Oral formulation can include standard carriers such as 

10 pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. Such compositions will 
contain a therapeutically effective amount of the compound, preferably in purified 
form, together with a suitable amount of carrier so as to provide the form for proper - 
administration to the patient. The formulation should suit the mode of 

15 administration. 

In a preferred embodiment, the composition is formulated in accordance with 
routine procedures as a pharmaceutical composition adapted for intravenous 
administration to a human. Typically, compositions for intravenous administration 
are solutions in sterile isotonic aqueous buffer. Where necessary, the composition 

20 may also include a solubilizing agent and a local anesthetic such as lignocaine to 
ease pain at the site of the injection. Generally, the ingredients are supplied either 
separately or mixed together in unit dosage form, for example, as a dry lyophilized 
powder or water free concentrate in a hermetically sealed container such as an 
ampoule or sachette indicating the quantity of active agent. Where the composition 

25 is to be administered by infusion, it can be dispensed with an infusion bottle 

containing sterile pharmaceutical grade water or saline. Where the composition is 
administered by injection, an ampoule of sterile water for injection or saline can be 
provided so that the ingredients may be mixed prior to administration. 

The compounds of the invention can be formulated as neutral or salt forms. 

30 Pharmaceutically acceptable salts include those formed with anions such as those 
derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, and the like, and 
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those fonned with cations such as those derived from sodium, potassium, 
ammonium, calcium, feme hydroxides, isopropylamine, triemylamine, 2-emylamino 
ethanol, histidine, procaine, and the like. 

The compositions of the invention can be administered alone or in 
5 combination with other therapeutic agents. Therapeutic agents that can be 

administered in combination with the compositions of the invention, include but are 
not limited to chemotherapeutic agents, antibiotics, steroidal and non-steroidal anti- 
inflammatories, conventional immunotherapeutic agents, cytokines and/or growth 
factors. Combinations may be administered either concomitantly, for example, as an 
10 admixture, separately but simultaneously or concurrently, or sequentially. This 
includes presentations in which the combined agents are administered together as a 
therapeutic mixture, and also procedures in which the combined agents are 
administered separately but simultaneously, for example, as through separate 
intravenous lines into the same individual. Administration "in combination" further 
15 includes the separate administration of one of the compounds or agents given first, 
followed by the second. 

Conventional nonspecific immunosuppressive agents, that may be 
aciministered in combination with the compositions of the invention include, but are 
not limited to, steroids, cyclosporin, cyclosporin analogs, cyclophosphamide 
20 methylprednisone, prednisone, azathioprine, FX-506, 15-deoxyspergualin, and other 
immunosuppressive agents. 

hi a further embodiment, the compositions of the invention are administered 
in combination with an antibiotic agent. Antibiotic agents that maybe administered 
with the compositions of the invention include, but are not limited to, tetracycline, 
25 metronidazole, amoxi^ 

quinolones, fluoroquinolones, cephalosporins, erythromycin, ciprofloxacin, and 
streptomycin. 

In an additional embodiment, the compositions of the invention are 
administered alone or in combination with an anti-inflammatory agent Anti- 
30 inflammatory agents that can be administered with the compositions of the invention 
include, but are not limited to, glucocorticoids and the nonsteroidal anti- 
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inflammatories, aminoarylcarboxylic acid derivatives, arylacetic acid derivatives, 
arylbutyric acid derivatives, arylcarboxylic acids, arylpropionic acid derivatives, 
pyrazoles, pyrazolones, salicylic acid derivatives, thiazinecarboxamides, e- 
acetamidocaproic acid, S-adenosylmethiomne, 3-amino-4-hydroxybutyric acid, 
5 amixetrine, bendazac, benzydamine, bucolome, difenpiramide, ditazol, emorfazone, 
guaiazulene, nabumetone, mmesulide, orgotein, oxaceprol, paranyline, perisoxal, 
pifoxime, proquazone, proxazole, and tenidap. 

In another embodiment, compositions of the invention are administered in 
combination with a chemotherapeutic agent. Chemotherapeutic agents that may be 
1 0 administered with the compositions of the invention include, but are not limited to, 
antibiotic derivatives (e.g., doxorubicin, bleomycin, daunorubicin, and 
dactinomycin); antiestrogens (e.g., tamoxifen); antimetabolites (e.g., fluorouracil, 5- 
FU, methotrexate, floxuridine, interferon alpha-2b, glutamic acid, plicamycin, 
mercaptopurine, and 6-thioguanine); cytotoxic agents (e.g., carmustine, BCNU, 
1 5 lomustine, CCNU, cytosine arabinoside, cyclophosphamide, estramustine, 

hydroxyurea, procarbazine, mitomycin, busulfan, cis-platin, and vincristine sulfate); 
hormones (e.g., medroxyprogesterone, estramustine phosphate sodium, ethinyl 
estradiol, estradiol, megestrol acetate, methyltestosterone, diethylstilbestrol 
diphosphate, chlorotrianisene, and testolactone); nitrogen mustard derivatives (e.g., 
50 mephalen, chorambucil, mecWoremamine (nitrogen mustard) and thiotepa); steroids 
and combinations (e.g., bethamethasone sodium phosphate); and others (e.g., 
dicarbazine, asparaginase, mitotane, vincristine sulfate, vinblastine sulfate, and 
etoposide). 

In an additional embodiment, the compositions of the invention are 
5 administered in combination with cytokines. Cytokines that may be adniinistered 
with the compositions of the invention include, but are not limited to, IL2, IL3, TLA, 
TL5, JL6, IL7, IL10, IL12, IL13, IL15, anti-CD40, CD40L, IFN-gamma and TNF- 
alpha. 

In additional embodiments, the compositions of the invention are 
0 administered in combination with other therapeutic or prophylactic regimens, such 
as, for example, radiation therapy. 
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The present invention is further directed to therapies which involve 
admimstering pharmaceutical compositions of the invention to an animal, preferably 
a m a mm al, and most preferably a human patient for treating one or more of the 
described disorders. Therapeutic compositions of the invention include, for 
5 example, therapeutic agents identified in screening assays, antibodies of the 

invention (including fragments, analogs and derivatives thereof as described herein), 
antisense oligonucleotides, ribozymes and nucleic acids encoding same. The 
compositions of the invention can be used to treat, inhibit, prognose, diagnose or 
prevent diseases, disorders or conditions associated with aberrant expression and/or 
1 0 activity of a polypeptide of the invention, including, but not limited to, any one or 
more of the diseases, disorders, or conditions such as, for example, MIX, AML, or 
ALL. 

The treatment and/or prevention of diseases and disorders associated with - 
aberrant expression and/or activity of a polypeptide of the invention includes, but is 
1 5 not limited to, alleviating symptoms associated with those diseases and disorders. 

The amount of the compound of the invention which will be effective in the 
treatment, inhibition and prevention of a disease or disorder associated with aberrant 
expression and/or activity of a polypeptide of the invention can be determined by 
standard clinical techniques. In addition, in vitro assays may optionally be employed 
20 to help identify optimal dosage ranges. The precise dose to be employed in the 
formulation will also depend on the route of administration, and the seriousness of 
the disease or disorder. Effective doses may be extrapolated from dose-response 
curves derived from in vitro or animal model test systems. 

For antibodies, the dosage administered to a patient is typically 0. 1 mg/kg to 
25 100 mg/kg of the patient's body weight. Preferably, the dosage administered to a 
patient is between 0.1 mg/kg and 20 mg/kg of the patient's body weight, more 
preferably 1 mg/kg to 10 mg/kg of the patient's body weight. Generally, human 
antibodies have a longer half-life within the human body than antibodies from other 
species due to the immune response to the foreign polypeptides. Thus, lower 
3 0 dosages of human antibodies and less frequent acmiinistration is often possible. 
Furthermore, the dosage and frequency of administration of antibodies of the 
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invention may be reduced by enhancing uptake and tissue penetration of the 
antibodies by modifications such as, for example, lipidation or addition of cell- 
specific tags. 

The compounds or pharmaceutical compositions of the invention can be 
5 tested in vitro, and then in vivo for the desired therapeutic or prophylactic activity, 
prior to use in humans. For example, in vitro assays to demonstrate the therapeutic 
or prophylactic utility of a compound or pharmaceutical composition include, the 
effect of a compound on a cell line or a patient tissue sample. The effect of the 
compound or composition on the cell line and/or tissue sample can be determined 

10 utilizing techniques known to those of skill in the art including, but not limited to, 
rosette formation assays and cell lysis assays. In accordance with the invention, in 
vitro assays which can be used to determine whether administration of a specific 
compound is indicated, include in vitro cell culture assays in which a patient tissue 
sample is grown in culture, and exposed to or otherwise administered a compound, 

15 and the effect of such compound upon the tissue sample is observed. 

The invention provides methods of treatment, inhibition and prophylaxis by 
administration to a subject of an effective amount of a compound or pharmaceutical 
composition of the invention. In one aspect, the compound is substantially purified 
such that the compound is substantially free from substances that limit its effect or 

20 produce undesired side-effects. The subject is preferably an animal, including but 
not limited to animals such as cows, pigs, horses, chickens, cats, dogs, etc., and is 
preferably a mammal, and most preferably human. 

Various delivery systems are known and can be used to administer a 
composition of the invention, for example, encapsulation in liposomes, 

25 microparticles, microcapsules, recombinant cells capable of expressing the 

compound, receptor-mediated endocytosis (see, e.g., Wu and Wu, J. Biol. Chem. 
262:4429-4432 (1987)), construction of a nucleic acid as part of a retroviral or other 
vector, and the like as will be known by one of skill in the art. 

Methods of introduction include, but are not limited to, intradermal, 

30 intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, and 

oral routes. The compounds or compositions may be administered by any 

i 
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convenient route, for example by infusion or bolus injection, by absorption through 
epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, 
etc.) and may be a(lministered together with, other biologically active agents. 
Adnunistration can be systemic or local. In addition, it may be desirable to 
5 introduce the pharmaceutical compounds or compositions of the invention into the 
central nervous system by any suitable route, including intraventricular and 
intrathecal injection; intraventricular injection maybe facilitated by an 
intraventricular catheter, for example, attached to a reservoir, such as an Ommaya 
reservoir. Pulmonary administration can also be employed, for example, by use of 
1 0 an inhaler or nebulizer, and formulation with an aerosolizing agent. 

In a specific embodiment, it may be desirable to administer the 
pharmaceutical compounds or compositions of the invention locally to the area in 
need of treatment; this may be achieved by, for example, and not by way of 
limitation, local infusion during surgery, topical application, for example, in 
1 5 conjunction with a wound dressing after surgery, by injection, by means of a 

catheter, by means of a suppository, or by means of an implant, said implant being of 
a porous, non-porous, or gelatinous material, including membranes, such as sialastic 
membranes, or fibers. Preferably, when administering a protein, including an 
antibody, of the invention, care must be taken to use materials to which the protein 
20 does not absorb. 

In another embodiment, the compound or composition can be delivered in a 
vesicle, such as a liposome (Langer, Science 249:1527-1533 (1990)). 

In yet another embodiment, the compound or composition can be delivered 
in a controlled release system. Furthermore, a controlled release system can be 
25 placed in proximity of the therapeutic target, thus requiring only a fraction of the 
systemic dose (see, e.g., Goodson, in Medical Applications of Controlled Release, 
supra, vol. 2, pp. 1 15-138 (1984)). In a further embodiment, a pump may be used. 
In another embodiment, polymeric materials can be used. 

In a particular embodiment where the compound of the invention is a nucleic 
30 acid encoding a protein, the nucleic acid can be administered in vivo to promote 
expression of its mRNA and encoded protein, by constructing it as part of an 
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appropriate nucleic acid expression vector and administering, for example, by use of 
a retroviral vector, or by direct injection, or by use of microparticle bombardment for 
example, a gene gun, or coating with lipids or cell-surface receptors or transfecting 
agents, or by administering it in linkage to a homeobox-like peptide which is known 
5 to enter the nucleus (see e.g., Joliot et aL, Proc. Natl. Acad Sci, USA 88:1864-1868 
(1991)). Alternatively, a nucleic acid can be introduced intracellularly and 
incorporated within host cell DNA for expression, by homologous recombination. 

The present invention also provides kits that can be used in the above 
methods. In one embodiment, a kit comprises a pharmaceutical composition of the 
1 0 invention in one or more containers. 

In another embodiment, the kit is a diagnostic kit for use in testing biological 
samples. The kit can include a control antibody that does not react with the 
polypeptide of interest in addition to a specific antibody or antigen-binding fragment 
thereof which binds to the polypeptide (antigen) of the invention being tested for in 
1 5 the biological sample. Such a kit may include a substantially isolated polypeptide 
antigen comprising an epitope that is specifically immunoreactive with at least one 
anti-polypeptide antigen antibody. Further, such a kit can include a means for 
detecting the binding of said antibody to the antigen (for example, the antibody may 
be conjugated to a fluorescent compound such as fluorescein or rhodamine which 
20 can be detected by flow cytometry). In a further embodiment, the kit may include a 
recombinantly produced or chemically synthesized polypeptide antigen. The 
polypeptide antigen of the kit may also be attached to a solid support. 

In an alternative embodiment, the detecting means of the above-described kit 
includes a solid support to which the polypeptide antigen is attached. The kit can 
25 also include a non-attached reporter-labeled anti-human antibody. Binding of the 
antibody to the polypeptide antigen can be detected by binding of the reporter- 
labeled antibody. 

hi an additional embodiment, the invention includes a diagnostic kit for use 
in screening serum samples containing antigens of the polypeptide of the invention. 
30 The diagnostic kit includes a substantially isolated antibody specifically 

immunoreactive with polypeptide or polynucleotide antigens, and means for 
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detecting the binding of the polynucleotide or polypeptide antigen to the antibody. 
In one embodiment, the antibody is attached to a solid support, hi another 
embodiment, the antibody may be a monoclonal antibody. The detecting means of 
the kit can include a second, labeled monoclonal antibody. Alternatively, or in 
5 addition, the detecting means can include a labeled, competing antigen 

In one diagnostic configuration, the test serum sample is reacted with a solid 
phase reagent having a surface-bound antigen obtained by the methods of the present 
invention. After binding with specific antigen antibody to the reagent and removing 
unbound serum components by washing, the reagent is reacted with reporter-labeled 
1 0 anti-human antibody to bind reporter to the reagent in proportion to the amount of 
bound anti-antigen antibody on the solid support. Generally, the reagent is washed 
again to remove unbound labeled antibody, and the amount of reporter associated 
with the reagent is determined. The reporter can be an enzyme, for example, which 
is detected by incubating the solid phase in the presence of a suitable fluorometric, 
1 5 luminescent or calorimetric substrate, as is standard in the art. 

The solid surface reagent in the above assay is prepared by known techniques 
for attaching protein material to solid support material. Suitable solid support 
materials include, for example and without limitation, polymeric beads, dip sticks, 
96-well plate or filter material. 
20 The invention will be further described with reference to the following non- 

limiting examples. The teaching of all patents, patent applications and all other 
publications and websites cited herein are incorporated by reference in their entirety. 

EXEMPLIFICATION 

Invariably, MLL translocations result in the production of a chimeric protein 

25 where the NH 2 -tenninal portion of MLL is fused to the COOH-terminal portion of 
one of >20 fusion partners (Dimartino and Cleary, Br J Haematol 106:614-626 
(1999)). This has prompted models of leukemogenesis in which the MLLfusion 
protein may confer a gain-of-function or neomorphic properties, or alternatively 
represent a dominant negative that interferes with normal MLL function. Moreover, 

3 0 mice heterozygous for MLL (+/-) demonstrate developmental aberrations (Yu et al., 
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Nature 378:505-508 (1995); Hess et al., Blood 90:1799-1806 (1997)), suggesting the 
disruption of one allele by chromosomal translocation might also manifest as haplo- 
insufficiency in leukemic cells. 

MLL is a homeotic regulator which shares homology with Drosophila 
5 trithorax (trx) and positively regulates the maintenance of homeotic (Hox) gene 
expression during development (Yu et al., Nature 378:505-508 (1995)). MLL 
deficient mice indicate that MLL is required for proper segment identity in the 
axioskeletal system, and also regulates hematopoiesis (Hess et al., Blood 90:1799- 
1806 (1997)). As MLL normally regulates Hox gene expression, its role in 

1 0 leukemogenesis may include altered patterns of HOX gene expression. An 

expanding body of literature shows that HOX genes are important for appropriate 
hematopoietic development (Buske and Humphries, Int J I Hematol 71:301-308 
(2000)). Also, the t(7;ll) (pl5;pl5) translocation found in human acute myelogenous 
leukemia (AML) results in a fusion of HOXA9 to the nucleoporin NUP98 

15 (Nakamura et al., Nat Genet 12:154-158 (1996) and Borrow et al., Nat Genet 

12: 159-167 (1996)). Thus, HOX genes represent one set of transcriptional targets 
that warrants assessment in leukemias with MLL translocation. 

We hypothesized that MLL translocations maintain a gene expression 
program that results in a distinct form of leukemia. It was reasoned that RNA 

20 profiles might help resolve whether leukemias bearing an MLL translocation 
represent a truly biphenotypic leukemia of mixed identity, a conventional B-cell 
precursor ALL with expression of limited myeloid genes, or a less committed 
hematopoietic progenitor cell Moreover, comparing gene expression profiles of 
lymphoblastic leukemias with and without rearranged MLL is important because of 

25 their dramatically different response to standard ALL therapy, and because such 
analysis may identify new molecular targets for therapeutic approaches. The 
expression profiles reported here reveal that lymphoblastic leukemias bearing MLL 
translocations display a remarkably uniform and highly distinct pattern that clearly 
distinguishes them from conventional ALL or AML and warrants designation as a 

3 0 distinct disease, MLL. 
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Methods 

Patient samples. After informed consent was obtained, leukemia samples were 
obtained from peripheral blood or bone marrow from patients at diagnosis or 
relapse. When the samples were obtained from peripheral blood, the percentage of 
5 blasts was greater than 60% of the total white blood cells present. Fifteen of the 
samples with an MLL translocation and all of the conventional childhood ALL 
samples were obtained from patients treated on Dana Farber Cancer Institute 
protocols between 1980 and 2001. Three of the infant leukemia samples with MLL 
rearrangements were obtained from patients treated on the Interfant99 protocol, and 

10 the two adult patients with MLL rearrangements were obtained from patients treated 
at Princess Margaret Hospital in Toronto. Except for one of the conventional ALL 
samples and four of the MLL samples that were obtained at relapse, all samples were 
diagnostic specimens. The AML samples have been previously described (Golub et 
al., Science 286:531-537 (1999)), and were from both adults and children. Eight of 

15 the MLL rearranged samples contain t(4;l 1), one t(9; 1 1), three t(l 1;19), one t(3;l 1) 
and one t(l; 1 1). Six of the MLL rearrangements were detected by either FISH or 
Southern blot, and thus the translocation partner is unknown. The mononuclear cells 
were purified from red blood cells and neutrophils by ficoll-hypaque density 
centrifugation and either frozen in liquid nitrogen with 1 0% DMSO in fetal calf 

20 serum or put directly into Trizol (Life Sciences) for RNA purification. 

Assessment for the presence of MLL translocations. All patient samples were 
assessed by standard cytogenetics. All childhood ALL patient samples were screened 
for the presence of a TELAML1 translocation by RT -PCR as previously described, 
Loh, ML. et al., Blood 92:4792-4797 (1998). Any patient sample where 
25 cytogenetics failed and had no TEL- AML I translocation was further assessed by 
fluoresence in situ hybridization (FISH) using a probe that spans the 1 lq23 
breakpoint or by Southern blot (Silverman et al., Cancer 80:2285-2295 (1997); and 
Cuthbert et al., Genes Chromosomes Cancer 29:180-185 (2000)). AML samples 
were not assessed for chromosomal translocations. 
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RNA purification, labeling and hybridization. A total of 10-20 xlO 6 cells were 
used to prepare total RNA using the Trizol (Life Sciences) purification method. 
This generally yielded between 5 and 20 \ig of total RNA the quality of which was 
analyzed by gel electrophoresis. If the rRNA bands were intact, the RNA was 
5 determined to be of good quality and 5-15 fig was used for subsequent production of 
biotinylated cRNA as described previously (Golub et al., Science 286:531-537 
(1999)), and were from both adults and children. Briefly, first strand cDNA 
synthesis was generated using a T7-linlced oligo-dT primer, followed by second 
strand synthesis. An in vitro transcription reaction was done to generate the cRNA 

10 containing biotinylated UTP and CTP, which was subsequently chemically 

fragmented at 95°C for 35 minutes. Samples were excluded if less than 15 \xg of 
labeled RNA was produced. Labeled RNA was then hybridized to Affymetrix 
(Santa Clara, CA) U95A or U95A V2 oligonucleotide airays at 45°C for 16 hours. 
Arrays were washed and stained with streptavidin-phycoerytherin (SAPE 5 Molecular 

15 Probes). The signal was amplified using a biotinylated anti-streptavidin antibody 
(V ector Laboratories, Burlingame, California) at 3 Hg/ml. This was followed by a 
second staining with SAPE. Normal goat IgG was used as a blocking agent. The 
scans were performed on Affymetrix scanners and the expression values calculated 
using Affymetrix GENECHIP software. The chip image was then scanned visually 

20 for obvious differences between arrays. If there were obvious abnormalities present 
in the image, the sample was re-hybridized. The scans were then normalized based 
on a linear scaling method as described in the supplementary material. The raw 
expression data as obtained from Asymetrix's GeneChip s re-scaled to account for 
different chip intensities. Briefly, each column (sample) in the dataset was 

25 multiplied by 1/slope of a least squares linear fit of the sample versus the reference 
(the first sample in the dataset). This linear fit is done using only genes that have 
■Present' calls in both the sample being re-scaled and the reference. The sample 
chosen as reference was a typical one (i.e., one with the number of "P" calls closer to 
the average over all samples in the dataset). Samples were disregarded if the scaling 

30 factor was greater than 3 fold. 
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A threshold of 100 units was imposed before analysis because at those low 
values the data is noisy and not very reproducible. A ceiling of 16,000 units was also 
imposed due to saturation effects. After this preprocessing gene expression values 
were subjected to a variation filter which excluded genes showing minimal variation 
5 across the samples being analyzed. The variation filter tests for a fold-change and 
absolute variation over samples (comparing max/min and max-min with predefined 
values and excluding genes not obeying both conditions). The max/min filter was 5 
and the max-min 500 for all experiments. 

Data analysis. Identification of genes that are correlated with particular class 
1 0 distinctions was performed as previously described (Golub et al., Science 286:53 1- 
537 (1999)). The signal-to-noise statistic (Uo-uO/H+Oj) was used where u and o 
represent the median and standard deviation of expression, respectively for each 
class. One hundred permutations of the samples were performed to determine if the 
correlations were greater than would be expected by chance with a 99% confidence. 
1 5 The class predictor was performed using a cross-validation approach and the 

K- Nearest Neighbors (K-nn) algorithm as follows. The k-nearest neighbors (k-NN) 
algorithm predicts the class of a new sample by calculating the Euclidean distance of 
the new sample to samples in a training set whose location has been identified in 
expression space. The predicted class of the new sample is then determined by 
20 identifying the class to which the majority of the k-nearest neighbors belong. The 
genes used to determine the location in expression space of each sample were 
identified by determining which genes best correlated with the class distinction as 
described above using the signal to noise statistic. For all experiments k=5. The 
prediction results shown in FIG. 6 were done using a cross validation approach 
25 where 1 of the 57 samples was withheld, the genes that best correlated with the 
ALIVMLL/AML distinction were determined, and those genes were used to build 
the k-nn algorithm. This was done for anywhere from 1-250 genes and me error rate 
(number of failures/57) is graphed vs. the number of genes. For the test set samples, 
the model was built with the 57 train set samples and then the class membership was 
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determined for each "test sample" by determining the samples "neighbors" in gene 
expression space as described above. 

Principal component analysis was performed using S-plus statistical software 
and the default settings. A commonly used technique for data reduction and 
5 visualization is principal component analysis (PCA). In this type of analysis, the 
linear combinations of variables are identified as the principal components that 
explain the variability in the dataset. To reduce the dimensionality of the data, the 
top 2 or 3 components can be graphed. In our case, the top 3 principal were thus 
used to project the samples in 3-dimensional space based on the gene expression 

10 profile. We first performed the analysis using the 8700 genes that passed the filtering 
described above. PCA was then performed using the top 500 genes that correlated 
with the AMU ALL class distinction. ALL, MLL, AML samples were then projected 
in that 500 gene space. These analysis were performed using S-plus statistical 
software using the default settings and co variance, followed by a three dimensional 

15 scatter plot of the coordinates of the 3 principal components for each sample. 

Singular value decomposition was used to derive the eigen values of the co variance 
matrix for the 8700 gene analysis. The S-Plus function used was "princompO". The 
coordinates of the three principal components for each sample were then used to 
project the samples in three dimensions. 

20 Results 

MLL is distinct from ALL 

To further define the biological characteristics specified by MLL 
translocations, gene expression profiles of leukemic cells from patients diagnosed 
with acute lymphoblastic leukemia bearing an MLL translocation were compared 
25 with conventional ALL which lack this translocation. Initially, samples from 20 

patients with childhood ALL (denoted ALL), and 17 patients with MLL translocation 
(referred to as MLL) were collected. Patient details are presented in Table 1 (MLL) 
and Table 2 (ALL). 
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Table 1 . MLL Patient Data 



15 



Patient 


Chromosome 


Sample 


Age at 
1 Diagnosis 


1 Specimen 


Figure/Column 
(MLL) 


MLL_1 


t(4;ll) 


BM 


1 month 


Diagnostic 


Fig. 1&5 
Column 1 


MLL_2 


So.Blot+# 


BM 


3 months 


Relapse 


Fig. 1&5 
Column 2 


MLL_3 


t(4;ll) 


PB 


8 months 


Diagnostic 


Fig. 1&5 
Column 3 


MLL_4 


FISH +* 


PB 


2 months 


Relapse 


Fig. 1&5 
Column 4 


MLL_5 


FISH + 


BM 


2 months 


Diagnostic 


Fig. 1&5 
Column 5 


MLL_6 


FISH + 


PB 


18 

months 


Diagnostic 


Fig. 1&5 
Column 6 


MLL_7 


t(4;ll) 


BM 


8 months 


Relapse 


Fig. 1&5 
Column 7 


MLL_8 


t(4;ll) 


PB 1 


5 months 


Diagnostic 


Fig. 1&5 
Column 8 


MLL_9 


So.Blot + 


PB 1 


7 months 


Diagnostic 


Fig. 1&5 
Column 9 


MLL_10 


t(l;ll) 


PB I 


1 month 


Diagnostic 


Fig. 1&5 
Column 10 


MLL_11 


t(3;ll) 
(ql3;q23) 


PB I 


1 day 


Diagnostic 


Fig. 1&5 
Column 11 


MLL_12 


t(H;i9) 


BM 


3 months 


Relapse 


Fig. 1&5 
Column 12 


MLL_13 


t(4;ll) 


PB 


1 month 


Diagnostic 


Fig. 1&5 
Column 13 



\ 
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MLL_14 


t(ll;19) 


BM 


3 months 


Diagnostic 


Fig. 1&5 
Column 14 


MLLJ5 


t(H;l9) 


PB 


7 months 


Diagnostic 


Fig. 1&5 
Column 15 


MLL_16 


t(4;ll) 


PB 


>21 years 


Diagnostic 


Fig. 1&5 
Column 16 


MLL_17 


t(4;ll) 


PB 


>21 years 


Diagnostic 


Fig. 1&5 
Column 17 


MLLJ8 


FISH + 


PB 


10 

months 


Diagnostic 


Fig. 5 Column 
18 


MLL_19 


t(9;ll) 


PB 


4 years 


Diagnostic 


Fig. 5 Column 
19 


MLL_20 


t(4;ll) 


PB 


6 years 


Diagnostic 


Fig. 5 Column 
20 



PB= peripheral blood 
BM= bone marrow 
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Table 2. ALL Patient Data 



Patient 


Tel/AMLl 


Chromosomes 


Sample 


Age at 
Diagnosis 


Specimen 


Figure/ 
Cohzmn 
(ALL) 


ALL_1 


Pos. 


46xy(+6) 


BM 


6y 


Diagnostic 


Fig. 1&5 

Column 

1 


ALL_2 


Pos. 


Diploid 


BM 


5y 


Diagnostic 


Fig. 1&5 

Column 

2 


ALL_3 


Pos. 


No data 


BM 


4y 


Diagnostic 


Fig. 1&5 

Column 

3 


ALL_4 


Pos. 


Hyper- 

diploid, 

del7q 


PB 


12y 


Diagnostic 


Fig. 1&5 

Column 

4 


ALL_5 


Pos. 


Hyper- 

diploid, 

de!7q 


BM 


12y 


Relapse 


Fig. 1&5 

Column 

5 


ALL_6 


Pos. 


Add 1 P 36, 
del 12p 


BM 


4y 


Diagnostic 


Fig. 1&5 

Column 

6 


ALL_7 


Pos. 


Diploid 


BM 


2y 


Diagnostic 


Fig. 1&5 

Column 

7 


ALL_8 


Pos. 


No Data 


BM 


3y 


Diagnostic 


Fig. 1&5 

Column 

8 
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ALL_9 


Neg. 


Diploid 


BM 


9y 


Diagnostic 


Fig. 1&5 . 

Column 

9 


ALL_10 


Neg. 


Del 3p, 
Add 12p, 
Add 17q 


BM 


5y 


Diagnostic 


Fig. 1&5 

Column 

10 


ALL_11 


Neg. 


del 12pl3 


BM 


20m 


Relapse 


Fig. 1&5 

Column 

11 


ALL_12 


Neg. 


Hyperdiploid 


BM 


13m 


Diagnostic 


Fig. 1&5 

Colunm 

12 


ALL_13 


Neg. | 


Hyperdiploid 


PB 


4y 


Relapse 


Fig. 1&5 

Column 

13 


ALL_14 


Neg. 


Diploid 


BM 


20m 


Diagnostic 


Fig. 1&5 

Column 

14 


ALL_15 


Neg. 


Diploid 


BM 


lOy 


Diagnostic 


Fig. 1&5 

Column 

15 


ALL_16 


Neg. 


Diploid 


BM 


4y 


Diagnostic 


Fig. 1&5 
Column , 
16 


ALL_17 


Neg. 


Del 9p21 


PB 


lly 


Diagnostic 


Fig. 1&5 

Column 

17 
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ALL_18 


Neg. 


Failed 
(MIX FISH 
Neg.) 


PB 


15y 


Diagnostic 


Fig. 1&5 

Column 

18 


ALL_19 


Neg. 


Hyperdiploid 


BM 


23m 


Diagnostic 


Fig. 1&5 

Column 

19 


ALL_20 


Neg. 


Diploid 


BM 


9y 


Diagnostic 


Fig. 1&5 

Column 

20 


ALL-21 


Neg. 


Failed 
(MIX FISH 
Neg.) 


PB 


12m 


Diagnostic 


Fig. 1&5 

Column 

21 


ALL_22 


Neg. 


t(l;19) 


PB 


7y 


Diagnostic 


Fig. 5 

Column 

22 


ALL_23 


Pos. 


Diploid 


PB 


3y 


Diagnostic 


Fig. 5 

Column 

23 


ALL_24 


Pos. 


No Data 


BM 


8y 


Diagnostic 


Fig. 5 

Column 

24 



Pos. = positive for TelAMLl translocation 
Neg = negative for TelAMLl translocation 



10 PB = peripheral blood 
BM = bone marrow 
m = months 
y = years 

First, it was determined if there were genes among the 12 5 600 tested whose 
1 5 expression pattern correlated with the presence of an MLL translocation. The genes 
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were sorted by their degree of correlation with the MLL/ALL distinction (FIGs. 1 A 
and IB), and permutation testing was used to assess the statistical significance of the 
observed differences in gene expression (Golub et al., Science 286:531-537 (1999)). 
For the 37 samples tested, approximately 1000 genes proved underexpressed in 
5 MLL as compared to conventional ALL while approximately 200 genes were 
relatively highly expressed. FIG. 1 A shows the top 50 genes that are relatively 
underexpressed in MLL, and FIG. IB shows the bottom 50 genes that are relatively 
overexpressed in MLL. Genes, and their GenBank Accession Numbers, are labeled 
at the right. The top 200 genes that make the ALL/MLL distinction and their 
1 0 GenBank Accession Numbers can be found in Table 3 (top 100 genes that are 
underexpressed in MLL compared to ALL) and Table 4 (top 100 genes that are 
overexpressed in MLL compared to ALL). 



WO 03/008552 



-57- 



PCT/US02/22823 



Table 3. Genes Underexpressed in MLL Compared to ALL 



GenBankNo. Name GenBank No. Name 





J03779 


CD10 


AA808961 


cDNAnwl6h03.sl 




AL050105 


DKFZp586H0519 


AB018303 


KIAA0760 






UJJZ4 


X74837 


HUMM9 




Y12735 


Dyrk3 


AL022723 


Chromosome 6 










sequence 




AB020674 


KIAA0867 


M74719 


E2-2 




D26070 


ITPR3 


M63838 


IFI16 


10 


Ml 1722 


TdT 


U81607 


Gravin 


- 


M61877 


a-spectrin 


U96113 


Nedd-4 like ubiquitin 




X59350 


CD22 


D13639 


ligase 
KIAK0002 




W25798 


cDNA 13fl2 


X53586 


Integrin alpha 6 


15 


AL049279 


DKFZp564I083 


U01062 


ITPR3 




AF032885 


FKHR 


M21535 


ERG 11 




L46922 


FHTT 


D88827 


Zinc finger protein 










FPM315 




S67427 


myosin 


W26406 


cDNA29b7 


20 


AL079277 


Unknown cDNA 


U15642 


E2F-5 




M96803 


b-spectrin 


D43949 


KIAA0082 




X83441 


DNALigaselV 


J011001 


TM7XN1 




XI 5357 


ANP-receptor 


M63928 


CD27 




M55284 


Protein Kinase C-L 


AB028961 


KIAA1038 


25 


AI146846 


cDNAqb92h04 


X55740 


5' Nucleotidase 




AB023176 


KIAA0959 


L05186 


Focal adhesion kinase 




AF002999 


TRF2 


AF070588 


cDNA 24554 




D26070 


ITPR3 


U48705 


Tyrosine Kinase DDR 




Y11312 


PB-Kinase 


AF070614 


cDNA 24732 


30 


U48959 


MLCK 


U23850 


ITPR3 
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Table 3 (continued) 



GenBank No. Name GenBank No. Name 





J05243 


a-spectrin 


AF054180 


Hematopoietic zinc 
finger protein 


5 


Y14768 


Cosmid TN62 


AL049471 


DKFZp586N012 




U01062 


ITPR3 


AF084481 


WFS1 




Z49194 


OBF-1 


U90547 


Human Ro 




V59423 


Smadl 


U15085 


HLA-DMB 




U29175 


Snf2-b 


AJ001381 


cDNA for an allele of 


10 








myosin 




M81141 


HLA-DQ-b 


AL049933 


DKFZp564K1216 




D87437 


KIAA0250 


AI198311 


cDNAqi61fll.xl 




Y00264 


Amyloid A4 precursor 


AL050060 


DKFZp566H073 




U59912 


Smadl 


X06318 


PKC-bl 


15 


AJO07583 


acetylglucosaminyltransferase U43885 


Grb2-associated 










binder-1 




L75847 


ZNF-45 


M31523 


E2A transcription factor 




D17530 


Dreberin E 


L10373 


clone CCG-B7 




D86967 


KIAA0212 


Y11306 


TCF-4 


20 


J03600 


Lipoxygenase 


X05323 


MRC OX-2 gene 




D42055 


KIAA0093 


X78932 


Zinc finger protein 
HZF9 




AL021154 


Chromosome 1 PAC 


W26633 


cDNA 34bl 




AI761647 


cDNAwg66h09 


AF052131 


clone 23930 


25 


AF054815 


VAMP5 


AL050260 


DKFZp564E1082 




N36926 


cDNA YY38E04 


AL080218 


DKFZp586N1323 




U96113 


Nedd-4-like ubiquitin ligase A1561 196 


cDNAtq27a01.xl 




AB019527 


LDOC1 


W26023 


cDNA18c3 



\ 
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Table 3 (continued) 





GenBank No. Name 


GenBank No. Name 




M34641 


FGF Receptor-1 


U68186 


Extracellular matrix I 




L29376 


MHC class I mRNA 


X62744 


RING6 


5 


M60028 


HLA-DQ-b 


X78926 


Zinc Finger HZF3 




Table 4. Genes Overexpressed in MLL Compared to ALL 






GenBank No. Name 


GenBank No. Name 




AI535946 


Lectin HL14 


U67516 


MAPKKK5 




M14087 


Lectin HL14 


- AF072099 


Immunoglobulin-like 


10 








transcript 3 




AI201310 


cDNAqf71bll 


X96753 


chondroitin sulfate 










proteoglycan NG2 




M80899 


AHNAK 


X52075 


CD43 




AJ001687 


NKG2D 


U38545 


Phospholipase I 


15 


U66838 


CyclinAl 


D00017 


Annexinll 




M59040 


CD44 


U02687 


FLT-3 




M95929 


Phoxl 


AA570193 


cDNAnf38cll.sl 




D25217 


KIAA0027 


U21551 


ECA39 




AI597616 


cDNAtnl5f08 


X55988 


eosinophil derived 


20 








neurotoxin 




W72186 


cDNAzd69blO 


D78177 


quinolinate 










phosphoribosyl 










transferase 




U41813 


HOXA9 


Z48579 


disintegrin-metalloprote 


25 








ase 




L05424 


CD44 


AF039656 


NAP-22 




L05424 


CD44 


Y00638 


CD45 



i 
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Table 4 (continued) 



GenBank No. Name GenBank No. Name 





AC004080 


Chromosome 7 PAC 


J03910 


Metallothionein-IG 




X05908 


Annexin I 


AF030339 


VESPR 


5 


U78027 


chromosome X BAC 


M28713 


NADH-cytochrome B5 










reductase 




Y00062 


CD45 


AB023209 


KIAA0992 




AF098641 


CD44 


L40377 


CAP2 




Y000638 


CD45 


D86181 


galactocerebrosidase 


10 


AF004230 


monocyte elastase inhibitor 


M83215 


AML1 




W60864 


cDNAzd27g05 


X61118 


LM02 




X55989 


Eosinophil CRP 


U01147 


ABR 




M93056 


Monocyte MIR-7 


M96995 


GRB2 




AF027208 


AC 133 


AF040704 


putative tumor 


15 








suppressor (101F6) 




Z83844 


Chromosome 22 sequence 


U57971 


calcium ATPase 




AL050396 


DKFZp586K1720 


AB028948 


KIAA1025 




AA978353 


cDNAoq40b07 


U11791 


cyclin H 




D21261 


KIAA0120 


AF022991 


Rigui 


20 


L08177 


EBV induced EBI2 


LI 1669 


Tetracycline-like 










transporter 




L19182 


MAC25 


U39064 


MAPKK6 




D28364 


Annexin II 


AF054176 


Angiotensin 




D15057 


DAD-1 


U87947 


HNMP-1 


25 


AF020044 


C-type lectin precursor 


L19872 


AH-receptor 




AI138834 


cDNAqe04b02 


M36035 


benzodiazepine receptor 




X73882 


E-MAP-115 


U93305 


Chromosome X pi 1 










sequence 




AF026816 


Putative Oncogene 


AF009615 


ADAM10 
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Table 4 (continued) 



GenBankNo. Name GenBank No. Name 



5 



X17042 


Proteoglycan I 


X52541 


EGR1 


M20867 


glutamate dehydrogenase 


AF044253 


potassium channel 






• 


beta-2 


U6006O 


FEZ1 


M26683 


Interferon gamma 








inducible mRNA 


AF025529 


LIR-6 


M31166 


TSG-14 


AB007888 


KIAA0428 


R93527 


cDNAyq35fl0.rl 


M13485 


Metallothionein I-B 


X15998 


chondroiton sulfate 








proteoglycan 


M26679 


HOXA5 


X55990 


eosinophil cationic 








protein 


AL050267 


DKFZp564A032 


U73960 


ARF-like protein 4 


R92331 


cDNA03h03 


M13452 


Lamin A 


AL050162 


DKFZp586B2022 


AI560890 


cDNA tq41d05.xl 


AI017574 


cDNAou23fl0 


AB024057 


vascular rab-gap 


M62896 


Annexinll 


M97815 


CRABP-E 


AL050374 


DKFZp586cl619 


M60614 


WIT-1 



20 As shown in FIGs. 1A and IB and Tables 1 and 2, MLL shows a 

dramatically different gene expression profile from AT J,, 

MLL shows multi-lineage gene expression 

Inspection of the genes differentially expressed between MLL and ALL was 
instructive (FIGs. 1A and IB and Tables 1 and 2). Many underexpressed genes in 
25 MLL have a function in early B cell development. These include genes expressed in 
early B-cells (CD 10, CD24, CD22, Tdl) (Hardy and Hayakawa, Annu Rev 
Immunol, 19:595-621 (2001); LeBien, Blood 96:9-23 (2000)), genes required for 
appropriate B-cell development (E2A, E2-2, PI3-Kinase, Octamer Binding Factor-1 , 
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andDNA ligaselV) (Murre, Cold Spring Barb Symp Quant Bio 164:39-44 (1999); 
Fruman et al., Science 283:393-397 (1999); Schubart et al., Nat Immunol 2:69-74 
(2001); and Frank et al., Nature 396:173-177 (1998)), and genes found to be 
correlated with B-precursor ALL in an AML/ALL comparison (Snj2-(3) (Golub et al., 
5 Science 286:531-537 (1999)). The relative underexpression of the forkhead 
(FKHR), SMAD1 and TCF-4 transcription factors suggests they may also be 
involved in later stages of B-cell differentiation or leukemogenesis. Relatively 
overexpressed genes in MLL include the adhesion molecules HL14, Annexin I 
Annexin II, CD44, and CD43. Multiple genes that are expressed in hematopoietic 

10 lineages other than lymphocytes are also highly expressed in MLL. These include 
genes expressed in progenitors (AC133, FLT3, LM02) (Yin et al., Blood 90:5002- 
5012 (1997); Rosnet et al., Blood 82:1 1 10-1 119 (1993); and Dong et al., Br J 
Haematol 93:280-286 (1996)), myeloid specific genes (Cyclin Al, monocyte elastase 
inhibitor, macrophage capping protein, eosinophil-CRP), (Yang et al., Blood 

15 93:2067-2074 (1999); Remold-O'Donnell et al., Proc NatlAcad Sci USA 89:5635- 
5639 (1992); and Rosenberg et al., J Exp Med. 170:163-176 (1989)), and at least one 
natural killer cell specific gene (NKG2D) (Ho et al, Proc Natl Acad Sci USA, 
95:6320-6325 (1998)) (FIGs. 1A, IB and Tables 1 and 2). Overexpression of 
HOXA9 and Proteoglycan I'm MLL is of particular interest as these genes were 

20 previously reported to be highly expressed in AML (Golub et al., Science 286:531- 
537 (1999)), and overexpression oiHOXA9 has been associated with a poor 
prognosis (Golub et al., Science 286:531-537 (1999)). 

MLL is arrested at an early stage of hematopoietic development 

Since lymphoblasts with MLL rearrangement express many myeloid specific 
25 genes, a detailed assessment of the expression of lymphoid genes was performed. 
Genes known to mark early B-lymphoid commitment such as IgP and CD19 are 
expressed in MLL albeit at lower levels than in ALL (FIGs. 2C and 2B). CD10 
(CALLA) is not expressed in MLL (FIG. 2A), whereas the IL-7 receptor is 
expressed at similar levels in ALL and MLL. 
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A number of genes have been shown to vary their expression level as murine 
hematopoietic cells transition from stem cell, to common lymphoid progenitor, to 
pro-B and then pre-B cells. IgP, CD24, CD44 and CD43 represent early steps of 
lymphoid development (Hardy and Hayakawa, Annu Rev Immunol 19:595-621 
5 (2001) and Kondo et al., Cell 91 :661-672 (1 997)). Igp and CD24 expression 

increases with maturation while CD44 and CD43 levels decrease (Kondo et al., Cell 
91 :661-672 (1997)). The MLL samples express relatively low levels of CD24 and 
Igp but high levels of both CD44 and CD43 (FIGs. 2F and 2E). In total these data 
suggest that MLL represents a maturational arrest at an early lymphoid progenitor 
10 stage of development. 

Selected HOX genes are highly expressed in MLL versus ALL 

Multiple members of the class I Hox genes are known to be regulated by Mil 
(Yu et al., Nature 378:505-508 (1995)) prompting a detailed comparison of the 
patterns of HOXgen& expression between ALL and MLL. Several of the 20 class I 
HOX genes present on the microarrays demonstrated significant and consistent 
differences in expression, HOXA9 and HOXA5 were not expressed in conventional 
ALL, but were expressed, often at high levels, in most MLL samples (FIGs. 3A-3D). 
Similarly, HOXA4 was typically expressed in MLL, but rarely in conventional ALL 
(FIG. 3C). HOXC6 showed mildly elevated levels of expression in MLL 
(Supplemental Information at http://research.clfci.harvard.edu/korsmeyer/MLL.htm). 
However, the tfOATpatterns displayed selectivity as other genes such as HOXA7 
showed no obvious difference in their expression pattern (FIG. 3D). MEIS1, a 
cofactor for HOX proteins, which can accelerate HoxA9 dependent leukemia 
(Nakamura et al., Nat Genet 12:149-1531 (1996)), was also significantly 
overexpressed in MLL as previously reported for the t(4;l 1) containing subset 
(Rozovskaia et al., Oncogene 20:874-878 (2001)). 

MLL is distinct from both AML and ALL 

MLL is characterized by the expression of myeloid specific genes, which 
raised the possibility that MLL is more closely related to acute myelogenous 



15 



20 



25 
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leukemia (AML). To determine if this is the case, or if MLL is separable as a 
distinct type of leukemia, a principal component analysis (PCA) was performed 
using the gene expression profiles of MLL, ALL and AML specimens. This 
clustering algorithm reduces complex multidimensional data to a few specified 
5 dimensions so that it can be visualized effectively (V enables and Ripley, Modern 
Applied Statistics with S-Plus, Springer Verlag, New York (1994)). First, the 
analysis was performed in an unsupervised manner using the 8700 genes that 
showed some variability in expression level. As expected, the ALL and AML 
samples displayed substantial separation (FIG. 4A). Of note, the MLL samples 

10 proved largely separate from either AML or ALL (FIG. 4A). In order to detennine if 
this separation could be attributed to a difference in hematopoietic identity, a similar 
analysis was performed using the 500 genes whose expression best distinguished the 
separation of AML versus ALL. When projected into this 500-gene space using - 
PCA the MLL samples principally fall between the AML and ALL samples (FIG. 

15 4B). 

Since the above clustering analyses supported three distinct entities of ALL, 
AML and MLL, it was queried if selected genes could be identified which 
distinguished each type of leukemia from the other two (FIG. 5). Conventional ALL 
expressed high levels of the following genes compared to MLL and AML: CD 10, 

20 CD24, DYRK f TdT, FKHR, DNA ligase IV, KIAA0867, CD22, OBF-1, B- spectrin, 
DKFZp5641083, Snf-2B, MLCK, VAMPS, and cDNA wg66h09) and these genes 
were underexpressed in MLL and ALL. AML samples expressed high levels of the 
following genes compared to ALL and MLL: adipsin, cathepsinD, CD13, M6 
antigen, gap junction protein, a-endosulfine, NF-2 transcription factor DP-2, 

25 DRAP1, cDNA 20cl, phosphodiesterase 3B, cosmid 19pl3.2, chromosome 19 clone, 
chromosome 22qll clone, and CRYAA, and these genes were underexpressed in 
ALL and MLL. MLL samples expressed high levels of the following genes 
compared to ALL and AML: AC 133, LM02, FLT3, KIAA0428, NKG2D, AD AMI 0, 
KIAA1025, Lectin HL14, cDNA ag36c04, cyclin Al, ADAM10, putative oncogene, 

30 DKFZp585o01, KIAA0920, and LM02, and these genes were underexpressed in 
ALL and AML. The GenBank Accession Numbers for each of these genes is shown 
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in FIG. 5. Permutation analysis indicated that approximately 200 genes were 
significantly overexpressed in MIX as compared to the other two leukemia 
categories. In combination, the PCA and gene expression comparisons (FIGs. 4A, 
4B and 5) indicate that MLL is a separable, distinct disease based on gene 
5 expression profile. These data also show that ALL and AML are separable distinct 
diseases. The genes shown in FIG. 5, particularly the genes that are over expressed 
in each disease type, can be used in gene expression profile studies to diagnose 
MLL, ALL, or AML. These genes, including underexpressed and overexpressed 
genes for each disease type, can also be used as target for identifying and/or 
10 detecting compounds that alter expression and/or activity of these genes or their 
gene products, for therapeutic methods, and for monitoring efficacy of treatment, as 
described herein. 

Gene expression profiles correctly classify ALL, MLL and AML 

A more stringent assessment of the power of the aforementioned difference 
15 in gene expression profiles would be their capacity to assign individual samples as 
MLL, ALL, or AML. The detection of MLL translocations in leukemia samples is 
currently most often performed by either cytogenetic analysis or by fluorescence in 
situ hybridization (FISH) which can technically fail or may be unavailable. Thus, 
other approaches to correctly assign individual cases to meaningful subsets of 

20 leukemia would be useful. To test this possibility a three-class predictor was 
developed based on a k- nearest neighbors algorithm (Dasarathy (ed), IEEE 
Computer Society Press, Los Altos, CA, December 1991. ISBN: 0818689307). 
This algorithm assigns a test sample to a class by identifying the k nearest samples in 
the training set and choosing the most common class among these k nearest 

25 neighbors. For this purpose, distances were defined by a euclidean metric based on 
the expression levels of a specified number of genes. 

The accuracy of this method was assessed using a cross validation approach. 
When one of the 57 samples is removed, the genes that most closely correlate with 
the ALL/MLL/AML class distinction are identified, and the expression of these 

30 genes used to determine the class of the withheld sample. The model assigned the 



WO 03/008552 



-66- 



PCT/US02/22823 



withheld sample to the appropriate class with 95% accuracy. Moreover this accuracy 
was maintained as we extended from 40 to 250 genes to build the predictor (FIG. 6), 
as further testimony to the strong distinction among these leukemia categories. 

To assess if the unique signature of gene profiles in MLL samples could be 
5 attributed to their occurrence in infants, the above model was tested using 1 0 

independent leukemia samples. The test set consisted of 3 childhood (> 12 months) 
conventional ALLs, 2 lymphoblastic leukemias of childhood carrying 
cytogenetically verified MLL translocations, 2 infant (<1 2 months) leukemias in 
which cytogenetic analysis did not detect an MLL translocation and 3 AML samples. 

1 0 Utilizing the 1 00 genes that best correlated with the three-class distinction, nine of 
ten samples were correctly classified as MLL, ALL or AML. The one apparent error 
was an infant reported to be negative for an MLL rearrangement by cytogenetics, yet 
consistently predicted to have a rearrangement based on gene expression profile. 
This prompted further analysis by FISH, which confirmed that this infant leukemia 

15 did indeed possess an MLL translocation and that the prospective assignment by 
expression profiling was correct. Taken together these data show that the unique 
gene expression profile characteristic of MLL cannot be attributed merely to the fact, 
that most samples are from infants. 

Discussion 

20 Gene expression profiles of lymphoblastic leukemias which possess an MLL 

translocation are remarkably consistent and differ significantly from those of other 
leukemias. Consequently, it is appropriate that they be considered a distinct disease 
entitled MLL for "Mixed Lineage Leukemia." This is supported by their 
comparison to conventional B cell precursor ALL that lacks MLL rearrangement, 

25 where - 1 000 genes proved underexpressed and -200 overexpressed in the MLL 
rearranged group. Moreover, evaluation of the expression profiles using principal 
component analysis indicated that MLL was clearly separable from conventional 
ALL and also AML. The expression differences are so robust that -95% of 
leukemic samples were correctly classified as MLL, ALL or AML. As testimony to 

30 the extent of divergence of MLL, it remained separable from ALL and AML when 
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250 genes were used to build the class predictor. This provides strong evidence that 
a specific chromosomal translocation results in a distinct type of lymphoblastic 
leukemia, rather than a model in which all translocations merely provide 
transformation events that subsequently converge upon a common pathway of 
5 leukemogenesis. In addition, the data indicate that MIX is arrested at an earlier 
stage of differentiation and/or has a different cell of origin than ALL. Select Hex 
genes are overexposed in MLL-dependent leukemia, as compared to normal B-cell 
progenitors and other ALL. FLT3 expression best distinguishes MLL from ALL. 
Gene expression patterns of MLL provide insight into the proposed models 
10 for its cellular origin. A summary of expression profiles shows that MLL expresses 
some lymphocyte specific and myeloid specific genes, but at lower levels than either 
conventional ALL or AML, respectively. Based on murine studies that have defined 
gene expression patterns during lymphocyte commitment (Hardy and Hayakawa, - 
Annu Rev Immunol 19:595-621 (2001); andKondo et al., Cell 91:661-672 (1997)), 
15 the low-level expression of CD24 and Ig P , along with high expression of CD43 and 
CD44 suggests that MLL is arrested at an earlier stage of development than 
conventional ALL. Furthermore, the expression of genes typically found in 
progenitor cells suggests MLL represents an early hematopoietic progenitor. This is 
consistent with studies that have shown multi-lineage gene expression in 
20 hematopoietic progenitors prior to full lineage commitment (Hu et al., Genes Dev 
1 1 :774-785 (1997)). Of particular interest is the possibility that MLL may represent 
the expansion of a bipotential B-macrophage progenitor (Montecino-Rodriguez et 
al., Nat Immunol 2:83-88 (2001); and Cumano et al., Nature 356:612-615 (1992)). 
Early B-cells can be induced to differentiate into myelomonocytic cells under certain 
25 conditions (Nutt et al., Nature 401:556-562 (1999)), and derivation of macrophages 
from leukemia cell lines has been well documented (Borrello and Phipps, Immunol 
Today 17:471-475 (1996)). An attractive model would hold that the MIL-fusion 
protein drives the < ^differentiation" of an early lymphocyte progenitor. The 
expression of many myeloid and monocyte/macrophage specific genes is consistent 
30 with MLL reflecting a very early B cell progenitor that has initiated trans- 
differentiation. The multiple HOXg^s that are selectively expressed in MLL are 
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attractive candidates for direct targets of the MLL- fusion proteins. Mil gene ablated 
mice have indicated that select members of the clustered Hox genes require MLL for 
their expression. As overexpression of HoxA9 has also been shown to induce AML 
in mouse models (Nakamura et al., Nat Genet 12:149-1531 (1996)), and its 
5 expression is controlled by levels of Mil (Hanson et al., Proc Natl Acad Sci USA 
96:14372-14377 (1999)); misexpression of HOXA9 may be an important component 
of MLL-translocation driven leukemogenesis. The findings here prompt further 
studies to detennine if MLL-fusion proteins directly activate HOX genes, and thus 
lead to defects in hematopoietic differentiation. 
1 0 This is the first demonstration in which whole genome profiling reveals that 

a chromosomal translocation can specify a unique gene expression program. This 
separates MLL as a distinct disease, which is of both pathogenic and therapeutic 
importance. Lymphoblastic leukemias with MLL translocations are recognized as 
having a poor prognosis, as standard ALL therapies have been relatively ineffective. 
1 5 The unique identity of MLL noted here provides insight into the poor response. As 
MLL is a distinct disease, new therapeutic approaches are needed. Of note, pilot 
studies have shown that addition of the drug cytarabine, an important agent in 
myeloid leukemia treatment, may improve the outcome for MLL patients (Ludwig et 
al., Blood 92:1898-1909 (1998); Silverman et al., Cancer 80:2285-2295 (1997); and 
20 Pieters et al., Leukemia 12:1 344- 1 348 (1998)). However, it is the translocation- 
specific therapies which have recently proven attractive for their efficacy and lack of 
toxicity. Other leukemias in which a translocation specifies a distinct disease are 
chronic myelogenous leukemia (CML) with the BCR-ABL fusion and acute 
promyelocytic leukemia (APL) with the PML-RARct fusion. The tailored 
25 development of the tyrosine kinase inhibitor STI571 and its treatment of CML and 
the use of all trans retinoic acid (ATRA) in APL has substantially improved the 
outcome in those diseases (Tallman et al., N Engl J Med 337:1021-1028 (1997) and 
Druker et al., N Engl J Med 344:1031-1037 (2001)). While pharmacologic 
approaches to the complex regulatory capacity of MLL may prove challenging, the 
30 distinct gene expression signature defined here for MLL may provide unanticipated 
molecular targets. Of special note, FLT3 is the most differentially expressed gene 
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that distinguishes MLL from ALL and AML (FIG. 5). Aberrations of FLT3, 
especially duplication of its juxtamembrane domain, have been noted in some cases 
of AML and may be leukemogenic (Nakao et al., Leukemia 10:1911-1918 (1996); 
Zhao et al., Leukemia 14:374-378 (2000); and Tse et al., Leukemia 14:1766-1776 
5 (2000)). As a tyrosine kinase receptor, FLT3 represents an attractive target for 
rational drug development. 

Additional information regarding the methods used to carry out the above 
described studies, patient samples, and the differentially expressed genes identified 
though these studies can be found at 
10 htm://research.dfci.harv ard.edu/korsmever/MLL.htrn. and htto://www- 

genome.wi.nut.edu/MPR, the teachings of which are incorporated herein by 
reference in their entirety. 

While this invention has been particularly shown and described with 
references to preferred embodiments thereof, it will be understood by those skilled 
15 in the art that various changes in form and details may be made therein without 
departing from the scope of the invention encompassed by the appended claims. 
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CLAIMS 

What is claimed is: 

1 . A method of diagnosing mixed lineage leukemia, acute lymphoblastic 

leukemia or acute myelogenous leukemia, comprising determining a gene 
5 expression profile of a gene expression product from at least one informative 

gene from one or more cells, wherein the cells are selected from the group 
consisting of mononuclear blood cells and bone marrow cells, and wherein 
the gene expression profile is correlated with mixed lineage leukemia, acute 
lymphoblastic leukemia or acute myelogenous leukemia. 

10 2. The method of Claim 1 , wherein the gene expression product is RNA. 

3. The method of Claim 2, wherein the gene expression profile is determined 
utilizing specific hybridization probes. 

4. The method of Claim 2, wherein the gene expression profile is determined 
utilizing oligonucleotide microarrays. 

15 5. The method of Claim 1 , wherein the gene expression product is a peptide. 

6. The method of Claim 5, wherein the gene expression profile is determined 
utilizing antibodies. 

7. The method of Claim 1, wherein the informative gene is selected from the 
group consisting of the genes in FIGs. 1 A, IB, 2A-2F, 3 A-3D, and 5, and 

20 Tables 1 and 2. 

8. A method of diagnosing mixed lineage leukemia, acute lymphoblastic 
leukemia or acute myelogenous leukemia, comprising: 



WO 03/008552 



-71- 



PCT/US02/22823 



a) determining a gene expression profile of mRNA from at least one 
informative gene, wherein the mRNA is isolated from one or more 
cells of an individual and selected from the group consisting of 
mononuclear blood cells and bone marrow cells; and 
5 b) comparing the gene expression profile of a) to a gene expression 

profile of a control sample selected from the group consisting of a 
mixed lineage leukemia sample, an acute lymphoblastic leukemia 
sample and an acute myelogenous leukemia sample, 
wherein the gene expression profile of a) is indicative of mixed lineage 
10 leukemia, acute lymphoblastic leukemia or acute myelogenous leukemia. 

9. A method of diagnosing mixed lineage leukemia, comprising determining a 
gene expression profile of a gene expression product from at least one 
informative gene from one or more cells selected from the group consisting 
of mononuclear cells and bone marrow cells, wherein the gene expression 

1 5 profile is correlated with mixed lineage leukemia. 

10. A method of detecting and/or identifying a compound for use in treating 
mixed lineage leukemia, comprising: 

a) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 

20 from the group consisting of mononuclear blood cells and bone 

marrow cells of an individual with mixed lineage leukemia; 

b) administering a test agent to the individual; 

c) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 

25 from the group consisting of mononuclear blood cells and bone 

marrow cells from the individual of step b); and 

d) comparing the genes expression profiles of step a) to the gene 
expression profile of step c), 
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wherein if the gene expression profile from the individual after 
administration of the agent is correlated with effective treatment of mixed 
lineage leukemia, then the test agent is a therapeutic agent. 

The method of Claim 10, wherein a decrease in the expression of the 
informative gene selected from the group consisting of FLT3, MEIS1, and 
HoxA9 is indicative of effective treatment of mixed lineage leukemia. 

A method of detecting and/or identifying a compound for use in treating 
acute lymphoblastic leukemia, comprising: 

a) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 
from the group consisting of mononuclear blood cells and bone 
marrow cells of an individual with acute lymphoblastic leukemia; 

b) administering a test agent to the individual; 

c) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 
from the group consisting of mononuclear blood cells and bone 
marrow cells from the individual of step b); and 

d) comparing the genes expression profiles of step a) to the gene 
expression profile of step c), 

wherein if the gene expression profile from the individual after 
administration of the agent is correlated with effective treatment of acute 
lymphoblastic leukemia, then the test agent is a therapeutic agent. 

A method of detecting and/or identifying a compound for use in treating 
acute myelogenous leukemia, comprising: 

a) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 
from the group consisting of mononuclear blood cells and bone 
marrow cells of an individual with acute myelogenous leukemia; 
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b) administering a test agent to the individual; 

c) determining a gene expression profile of a gene expression product 
from at least one informative gene from one or more cells selected 
from the group consisting of mononuclear blood cells and bone 
marrow cells from the individual of step b); and 

d) comparing the genes expression profiles of step a) to the gene 
expression profile of step c), 

wherein if the gene expression profile from the individual after 
acmiinistration of the agent is correlated with effective treatment of acute 
myelogenous leukemia, then the test agent is a therapeutic agent. 

A method of detecting and/or identifying a compound for use in treating 
mixed lineage leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 

b) detecting an alteration of a gene expression profile of a gene 
expression product from at least one informative gene from the cell 
sample or lysate thereof, 

wherein a compound that increases the gene expression profile of at least one 
informative gene which is decreased in mixed lineage leukemia is a 
compound for use in treating mixed lineage leukemia. 

A method of detecting and/or identifying a compound for use in treating 
acute lymphoblastic leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 

b) detecting an alteration of a gene expression profile of a gene 
expression product from at least one informative gene from the cell 
sample or lysate thereof, 
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wherein a compound that increases the gene expression profile of at least one 
informative gene which is decreased in acute lymphoblastic leukemia is a 
compound for use in treating acute lymphoblastic leukemia. 

16. A method of detecting and/or identifying a compound for use in treating 
5 acute myelogenous leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 

b) detecting an alteration of a gene expression profile of a gene 

10 expression product from at least one informative gene from the cell 

sample or lysate thereof, 
wherein a compound that increases the gene expression profile of at least one 
informative gene which is decreased in acute myelogenous leukemia is a 
compound for use in treating acute myelogenous leukemia. 

15 17. A method of detecting and/or identifying a compound for use in treating 
mixed lineage leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 
20 b) detecting an alteration of a gene expression profile of a gene 

expression product from at least one informative gene from the cell 
sample or lysate thereof, 
wherein a compound that decreases the gene expression profile of at least 
one informative gene which is increased in mixed lineage leukemia is a 
25 compound for use in treating mixed lineage leukemia. 

1 8 . The method of Claim 1 7, wherein the informative gene is selected from the 
group consisting of 'FLT3, MEIS1, and HoxA9. 
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19. A method of detecting and/or identifying a compound for use in treating 
acute lymphoblastic leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 

b) detecting an alteration of a gene expression profile of a gene 
expression product from at least one informative gene from the cell 
sample or lysate thereof, 

wherein a compound that decreases the gene expression profile of at least 
one informative gene which is increased in acute lymphoblastic leukemia is a 
compound for use in treating acute lymphoblastic leukemia. 

20. A method of detecting and/or identifying a compound for use in treating 
acute myelogenous leukemia, comprising: 

a) contacting a cell sample or lysate thereof with a candidate compound, 
wherein the cell is selected from the group consisting of mononuclear 
blood cells and bone marrow cells; and 

b) detecting an alteration of a gene expression profile of a gene 
expression product from at least one informative gene from the cell 
sample or lysate thereof, 

wherein a compound that decreases the gene expression profile of at least 
one informative gene which is increased in acute myelogenous leukemia is a 
compound for use in treating acute myelogenous leukemia. 

A method for evaluating a drug candidate for effectiveness in treating mixed 
lineage leukemia, comprising determining a gene expression profile of a 
gene expression product from at least one informative gene, wherein tie gene 
expression product is isolated from cells derived from the group consisting 
of a blood or bone marrow sample from an individual to whom the drug 
candidate has been administered, wherein the gene expression profile is 
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indicative of the effectiveness of the drug candidate in treating mixed lineage 
leukemia. 

22. A method for evaluating a drug candidate for effectiveness in treating acute 
lymphoblastic leukemia, comprising determining a gene expression profile of 

5 a gene expression product from at least one informative gene, the gene 

expression product isolated from cells derived from the group consisting of a 
blood or bone marrow sample from an individual to whom the drug 
candidate has been administered, wherein the gene expression profile is 
indicative of the effectiveness of the drug candidate in treating acute 
10 lymphoblastic leukemia. 

23. A method for evaluating a drug candidate for effectiveness in treating acute 
myelogenous leukemia, comprising determining a gene expression profile of 
a gene expression product from at least one informative gene, the gene 
expression product isolated from cells derived from the group consisting of a 

15 blood or bone marrow sample from an individual to whom the drug 

candidate has been administered, wherein the gene expression profile is 
indicative of the effectiveness of the drug candidate in treating acute 
myelogenous leukemia. 

A method for monitoring the efficacy of a mixed lineage leukemia treatment, 
comprising: 

a) determining the gene expression profile of a gene expression product 
from at least one informative gene in a cell from blood samples 
derived from an individual being treated, wherein the samples are 
obtained at various time points; and 

b) comparing the treatment outcome of the samples at various times 
during treatment, 

wherein the efficacy of mixed lineage leukemia treatment is determined. 
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25 . A method for monitoring the efficacy of an acute lymphoblastic leukemia 
treatment, comprising: 

a) determining the gene expression profile of a gene expression product 
from at least one informative gene in a cell from blood samples 

5 derived from an individual being treated, wherein the samples are 

obtained at various time points; and 

b) comparing the treatment outcome of the samples at various times 
during treatment, 

wherein the efficacy of acute lymphoblastic leukemia treatment is 
10 determined. 

26. A method for monitoring the efficacy of an acute myelogenous leukemia 
treatment, comprising: 

a) determining the gene expression profile of a gene expression product 
from at least one informative gene in a cell from blood samples 

15 derived from an individual being treated, wherein the samples are 

obtained at various time points; and 

b) comparing the treatment outcome of the samples at various times 
during treatment, 

wherein the efficacy of acute myelogenous leukemia treatment is determined. 

A method of predicting the efficacy of treating mixed lineage leukemia, 
comprising determining a gene expression profile of a gene expression 
product from at least one informative gene, the gene expression product 
isolated from one or more cells selected from the group consisting of 
mononuclear cells and bone marrow cells of an individual with mixed 
lineage leukemia, wherein the gene expression profile is correlated with a 
treatment outcome, thereby classifying the sample with respect to treatment 
outcome. 
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A method of predicting the efficacy of treating acute lymphoblastic 
leukemia, comprising determining a gene expression profile of a gene 
expression product from at least one informative gene, the gene expression 
product isolated from one or more cells selected from the group consisting of 
mononuclear cells and bone marrow cells of an individual with acute 
lymphoblastic leukemia, wherein the gene expression profile is correlated 
with a treatment outcome, thereby classifying the sample with respect to 
treatment outcome. 

A method of predicting the efficacy of treating acute myelogenous leukemia, 
comprising determining a gene expression profile of a gene expression 
product from at least one informative gene, the gene expression product 
isolated from one or more cells selected from the group consisting of 
mononuclear cells and bone marrow cells of an individual with acute 
myelogenous leukemia, wherein the gene expression profile is correlated 
with a treatment outcome, thereby classifying the sample with respect to 
treatment outcome. 

A method of treating mixed lineage leukemia, comprising administering to 
an individual in need thereof a therapeutic amount of an agent that inhibits 
the activity of a gene product which is increased in mixed lineage leukemia 

A method of treating acute lymphoblastic leukemia, comprising 
adbninistering to an individual in need thereof a therapeutic amount of an 
agent that inhibits the activity of a gene product which is increased in acute 
lymphoblastic leukemia. 

A method of treating acute myelogenous leukemia, comprising administering 
to an individual in need thereof a therapeutic amount of an agent that inhibits 
the activity of a gene product that is increased in acute myelogenous 
leukemia. 
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33. A method of treating mixed lineage leukemia, comprising administering to 
an individual in need thereof a therapeutic amount of an agent that enhances 
the activity of a gene product that is decreased in mixed lineage leukemia. 

34. A method of treating acute lymphoblastic leukemia, comprising 
administering to an individual in need thereof a therapeutic amount of an 
agent that enhances the activity of a gene product that is decreased in acute 
lymphoblastic leukemia. 

35. A method of treating acute myelogenous leukemia, comprising acmiinistering 
to an individual in need thereof a therapeutic amount of an agent mat 
enhances the activity of a gene product that is decreased in acute 
myelogenous leukemia 

36. An oligonucleotide microarray having immobilized thereon a plurality of 
oligonucleotide probes specific for one or more informative genes for 
diagnosing mixed lineage leukemia, acute lymphoblastic leukemia, or acute 
myelogenous leukemia wherein the informative genes are selected from the 
group consisting of the genes in FIGs. 1A, IB, 2A-2F, 3A-3D, and 5, and 
Tables land 2. 
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